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EXPRESSION VECTORS FOR STIMULATING AN IMMUNE 
RESPONSE AND METHODS OF USING THE SAME 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application claims the benefit of 09/078,904, filed May 13, 1998, and 

60/085,751, filed May 15, 1998, both herein incorporated by reference in their entirety. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

10 This invention was made with government support under NIH Grant No. 

AI-42699-01, NIH Grant No. AI38584-03, and NIH Contract No. NOl-AI-45241. The 

Government has certain rights in this invention. 

FIELD OF THE INVENTION 
1 5 The present invention relates to nucleic acid vaccines encoding multiple 

CTL and HTL epitopes and MHC targeting sequences. 

BACKGROUND OF THE INVENTION 
Vaccines are of fundamental importance in modem medicine and have 
20 been highly effective in combating certain human diseases. However, despite the 

successfiil implementation of vaccination programs that have greatly limited or virtually 
eliminated several debihtating human diseases, there are a number of diseases that affect 
millions worldwide for which effective vaccines have not been developed. 

Major advances in the field of inamunology have led to a greater 
25 understanding of the mechanisms involved in the immune response and have provided 
insights into developing new vaccine strategies (Kuby, Immunology, 443-457 (3rd ed., 
1997), which is incorporated herein by reference). These new vaccine strategies have 
taken advantage of knowledge gained regarding the mechanisms by which foreign 
material, termed antigen, is recognized by the immune system and eliminated fi-om the 
30 organism. An effective vaccine is one that elicits an immune response to an antigen of 
interest. 

Specialized cells of the immune system are responsible for the protective 
activity required to combat diseases. An immune response involves two major groups of 
cells, lymphocytes, or white blood cells, and antigen-presenting cells. The purpose of 
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these immune response cells is to recognize foreign material, such as an infectious 
organism or a cancer cell, and remove that foreign material from the organism. 

Two major types of lymphocytes mediate different aspects of the immune 
response. B cells display on their cell surface specialized proteins, called antibodies, that 

5 bind specifically to foreign material, called antigens. Effector B cells produce soluble 
forms of the antibody,' which circulate throughout the body and function to eliminate 
antigen from the organism. This branch of the immune system is known as the humoral 
branch. Memory B cells ftinction to recognize the antigen in future encounters by 
continuing to express the membrane-bound form of the antibody. 

1 0 A second major type of lymphocyte is the T cell. T cells also have on their 

cell surface specialized proteins that recognize antigen but, in contrast to B cells, require 
that the antigen be bound to a specialized membrane protein complex, the major 
histocompatibility complex (MHC), on the surface of an antigen-presenting cell. Two 
major classes of T cells, termed helper T lymphocytes ("HTL") and cytotoxic T 

1 5 lymphocytes ("CTL"), are often distinguished based on the presence of either CD4 or 
CDS protein, respectively, on the cell surface. This branch of the immune system is 
known as the cell-mediated branch. 

The second major class of immune response cells are cells that function in 
antigen presentation by processing antigen for binding to MHC molecules expressed in 

20 the antigen presenting cells. The processed antigen bound to MHC molecules is 

transferred to the surface of the cell, where the antigen-MHC complex is available to bind 
to T cells. 

MHC molecules can be divided into MHC class I and class II molecules 
and are recognized by the two classes of T cells. Nearly all cells express MHC class I 

25 molecules, which fimction to present antigen to cytotoxic T lymphocytes. Cytotoxic T 
lymphocytes typically recognize antigen bound to MHC class I. A subset of cells called 
antigen-presenting cells express MHC class H molecules. Helper T lymphocytes 
typically recognize antigen bound to MHC class H molecules. Antigen-presenting cells 
include dendritic cells, macrophages, B cells, fibroblasts, gUal cells, pancreatic beta cells, 

30 thymic epithelial cells, thyroid epithelial cells and vascular endothelial cells. These 

antigen-presenting cells generally express both MHC class I and class II molecules. Also, 
B cells function as both antibody-producing and antigen-presenting cells. 

Once a helper T lymphocyte recognizes an antigen-MHC class D complex 
on the surface of an antigen-presenting cell, the helper T lymphocyte becomes activated 

- 2 - 



wo 99/58658 




PCT/US99/10646 



and produces growth factors that activate a variety of cells involved in the immune 
response, including B cells and cytotoxic T lymphocytes. For example, under the 
influence of growth factors expressed by activated helper T lymphocytes, a cytotoxic T 
lymphocyte that recognizes an antigen-MHC class I complex becomes activated. CTLs 

5 monitor and eliminate cells that display antigen specifically recognized by the CTL, such 
as infected cells or tumor cells. Thus, activation of helper T lymphocytes stimulates the 
activation of both the humoral and cell-mediated branches of the inmiune system. 

An important aspect of the immune response, in particular as it relates to 
vaccine efficacy, is the manner-in which antigen is processed so that it can be recognized 

10 by the specialized cells of the immune system. Distinct antigen processing and 

presentation pathways are utilized. The one is a cytosolic pathway, which results in the 
antigen being bound to MHC class 1 molecules. An alternative pathway is an 
endoplasmic reticulum pahtway, which bypasses the cytosol. Another is an endocytic 
pathway, which results in the antigen being bound to MHC class 11 molecules. Thus, the 

1 5 cell surface presentation of a particular antigen by a MHC class II or class I molecule to a 
helper T lymphocyte or a cytotoxic T lymphocyte, respectively, is dependent on the 
processing pathway for that antigen. 

The cvlosolic pathway processes endogenous antigens that are expressed 
inside the cell. The antigen is degraded by a specialized protease complex in the cytosol 

20 of the cell, and the resulting antigen peptides are transported into the endoplasmic 
reticulum, an organelle that processes cell surface molecules. In the endoplasmic 
reticulum, the antigen peptides bind to MHC class I molecules, which are then 
transported to the cell surface for presentation to cytotoxic T lymphocytes of the immune 
system. 

25 Antigens that exist outside the cell are processed by the endocytic 

pathway. Such antigens are taken into the cell by endocytosis, which brings the antigens 
into specialized vesicles called endosomes and subsequently to specialized vesicles called 
lysosomes, where the antigen is degraded by proteases into antigen peptides that bind to 
MHC class n molecules. The antigen peptide-MHC class II molecule complex is then 

30 transported to the cell surface for presentation to helper T lymphocytes of the immune 
system. 

A variety of factors must be considered in the development of an effective 
vaccine. For example, the extent of activation of either the humoral or cell-mediated 
branch of the immune system can determine the effectiveness of a vaccine against a 
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particular disea^^urthermore, the development of immunologic memory by inducing 
memory-cell formation can be important for an effective vaccine against a particular 
disease (Kuby, supra). For example, protection from infectious diseases caused by 
pathogens with shon incubation periods, such as influenza virus, requires high levels of 

5 neutralizing antibody generated by the humoral branch because disease symptoms are 
already underway before memory cells are activated. Alternatively, protection from 
infectious diseases caused by pathogens with long incubation periods, such as polio virus, 
does not require neun-alizing antibodies at the time of infection but instead requires 
memory B cells that can generate neutralizing antibodies to combat the pathogen before it 

1 0 is able to infect target tissues. Therefore, the effectiveness of a vaccine at preventing or 
ameliorating the symptoms of a particular disease depends on the type of immune 
response generated by the vaccine. 

Many aaditional vaccines have relied on intact pathogens such as 
attenuated or inactivated viruses or bacteria to elicit an immune response. However, 

1 5 these traditional vaccines have advantages and disadvantages, including reversion of an 
attenuated pathogen to a virulent form. The problem ofreversion of an attenuated 
vaccine has been addressed by the use of molecules of the pathogen rather than the whole 
pathogen. For example, immunization approaches have begun to incorporate 
recombinant vector vaccines and synthetic peptide vaccines (Kuby, supra). Recently, 

20 DNA vaccines have also been used (Donnelly et al, Annu. Rev. Immunol. 15:617-648 
(1997), which is incorporated herein by reference). The use of molecules of a pathogen 
provides safe vaccines that circumvent the potential for reversion to a virulent form of the 
vaccine. 

The targeting of antigens to MHC class II molecules to activate helper T 
25 lymphocytes has been described using lysosomal targeting sequences, which direct 

antigens to lysosomes, where the antigen is digested by lysosomal proteases into antigen 
peptides that bind to MHC class II molecules (U.S. Patent No. 5,633,234; Thomson et al, 
J, Virol 72:2246-2252 (1998)). It would be advantageous to develop vaccines that 
deliver multiple antigens while exploiting the safety provided by administering individual 
30 epitopes of a pathogen rather than a whole organism. In particular, it would be 
advantageous to develop vaccines that effectively target antigens to MHC class II 
molecules for activation of helper T lymphocytes. 

Several studies also point to the crucial role of cytotoxic T cells in both 
production and eradication of infectious diseases and cancer by the immune system 
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(Byrne et al. J. Immunol. 51:682 (1984); McMichael et al, N. Engl J, Med. 309:13 
(1983)). Recombinant protein vaccines do not reliably induce CTL responses, and the 
use of otherwise immunogenic vaccines consisting of attenuated pathogens in humans is 
hampered, in the case of several important diseases, by overriding safety concerns. In the 
5 case of diseases such as HIV, HBV, HCV, and malaria, it appears desirable not only to 
induce a vigorous CTL response, but also to focus the response against highly conserved 
epitopes in order to prevent escape by mutation and overcoine variable vaccine efficacy 
against different isolates of the target pathogen. 

— - Induction of a broad response directed simultaneously against multiple 
10 epitopes also appears to be crucial for development of efficacious vaccines. HIV 
infection is perhaps the best example where an infected host may benefit fi-onni a 
multispecific response. Rapid progression of HIV infection has been reported in cases 
where a narrowly focused CTL response is induced whereas nonprogressors tend to show 
a broader specificity of CTLs (Goulder et al, Nat Med, 3:212 (1997); Borrow et al, Nat. 
1 5 Med. 3 :205 ( 1 997)). The highly variable nature of HIV CTL epitopes resulting from a 
highly mutating genome and selection by CTL responses directed against only a single or 
few epitopes also supports the need for broad epitope CTL responses (McMichael et al, 
Amu, Rev, Immunol 15:271 (1997)). 

One potential approach to induce multispecific responses against 
20 conserved epitopes is immunization with a minigene plasmid encoding the epitopes in a 
string-of-beads fashion. Induction of CTL, HTL, and B cell responses in mice by 
minigene plasmids have been described by several laboratories using constructs encoding 
as many as 11 epitopes (An et al, J. Virol 71:2292 (1997); Thomson et al, J. Immunol 
157:822 (1996); Whitton et al, 1 Virol 67:348 (1993); Hanke et al. Vaccine 16:426 
25 (1998); Vitiello et al, Eur. J. Immunol 27:671-678 (1997)). Minigenes have been 

deUvered in vivo by infection with recombinant adenovirus or vaccinia, or by injection of 
pxirified DNA via the intramuscular or intradermal route (Thomson et al, J, Immunol 
160:1717 (1998); Toes et al, Proc, Natl Acad, ScL USA 94:14660 (1997)). 

Successfixl development of minigene DNA vaccines for human use will 
30 require addressing certain fundamental questions dealing with epitope MHC affinity, 
optimization of constructs for maximum in vivo immunogenicity, and development of 
assays for testing in vivo potency of multi-epitope minigene constructs. Regarding MHC 
binding affinity of epitopes, it is not currently known whether both high and low affinity 
epitopes can be included within a single minigene construct, and what ranges of peptide 
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affinity are permissible for CTL induction in vivo. This is especially important because 
dominant epitopes can vary in their affinity and because it might be important to be able 
to deliver mixtures of dominant and subdominant epitopes that are characterized by high 
and low MHC binding affinities. 

5 With respect to minigene construct optimization for maximum 

immunogenicity in vivo, conflicting data exists regarding whether the exact position of 
the epitopes in a given construct or the presence of flanking regions, helper T cell 
epitopes, and signal sequences might be crucial for CTL induction (Del Val et al. Cell 
66:1145 (1991);-Bergmann et al, J. Virol. 6S:5306 (1994); Thomson etal, Proc. Natl. 

10 Acad. Sci. USA 92:5845 (1995); Shirai et al. J. Infect. Dis. 173:24 (1996); Rahemtulla et 
al.. Nature 353:180 (1991); Jennings etal., Cell. Immunol. 133:234 (1991); Anderson et 
al., J. Exp. Med. 174:489 (1991); Uger et al.. J. Immunol. 158:685 (1997)). Finally, 
regarding development of assays that allow testing of human vaccine candidates, it should 
be noted that, to date, all in vivo immunogenicity data of multi-epitope minigene plasmids 

1 5 have been performed with murine class I MHC-restricted epitopes. It would be 
advantageous to be able to test the in vivo immunogenicity of minigenes containing 
human CTL epitopes in a convenient animal model system. 

Thus, there exists a need to develop methods to effectively deliver a 
variety of HTL (helper T lymphocyte) and CTL (cytotoxic T lymphocyte) antigens to 

20 stimulate an immune response. The present invention satisfies this need and provides 
related advantages as well. 

SUMMARY OF THE INVENTION 
The invention therefore provides expression vectors encoding two or more 

25 HTL epitopes fused to a MHC class II targeting sequence, as well as expression vectors 
encoding a CTL epitope and a universal HTL epitope fused to an MHC class I targeting 
sequence. The HTL epitope can be a universal HTL epitope (also referred to as a 
universal MHC class II epitope). The invention also provides expression vectors 
encoding two or more HTL epitopes fiised to a MHC class II targeting sequence and 

30 encoding one or more CTL epitopes. The invention additionally provides methods of 
stimulating an immune response by administering an expression vector of the invention in 
vivo, as well as methods of assaying the human immunogenicity of a human T cell 
peptide epitope in vivo in a non-human mammal. 

- 6 - 
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In one aspect, the present invention provides an expression vector 
comprising a promoter operably linked to a first nucleotide sequence encoding a major 
histocompatibility (MHC) targeting sequence fiised to a second nucleotide sequence 
encoding two or more heterologous peptide epitopes, wherein the heterologous peptide 
5 epitopes comprise nvo HTL peptide epitopes or a CTL peptide epitope and a universal 

HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 
immune response in vivo comprising administering to a mammalian subject an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
10 major histocompatibility (MHC) targeting sequence fused to a second nucleotide 

sequence encoding two or more heterologous peptide epitopes, wherein the heterologous 
peptide epitopes comprise two HTL peptide epitopes or a CTL peptide epitope and a 
universal HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 
1 5 immune response in vivo comprising administering to a mammahan subj ect an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
major histocompatibility (MHC) targeting sequence fused to a second nucleotide 
sequence encoding a heterologous human HTL peptide epitope. 

In another aspect, the present invention provides a method of assaying the 
20 human immunogenicity of a human T cell peptide epitope in vivo in a non-human 

mammal, comprising the step of administering to the non-human mammal an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
heterologous human CTL or HTL peptide epitope. 

In one embodiment, the heterologous peptide epitopes comprise two or 
25 more heterologous HTL peptide epitopes, hi another embodiment, the heterologous 
peptide epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 
In another embodiment, the heterologous peptide epitopes fiirther comprise one to two or 
more heterologous CTL peptide epitopes. In another embodiment, the expression vector 
comprises both HTL and CTL peptide epitopes. 
30 In one embodiment, one of the HTL peptide epitopes is a universal HTL 

epitope. In another embodiment, the universal HTL epitope is a pan DR epitope. In 
another embodiment, the pan DR epitope has the sequence 
AlaLysPheValAla-AJaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

- 7 - 
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In one embodiment, the peptide epitopes are hepatitis B virus epitopes, 
hepatitis C virus epitopes, human immunodeficiency virus epitopes, human papilloma 
virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, PAP epitopes, p53 
epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. In another 
5 embodiment, the peptide epitopes each have a sequence selected from the group 

consisting of the peptides depicted in Tables 1-8. In another embodiment, at least one of 
the peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

In one embodiment, the MHC targeting sequence comprises a region of a 
polypeptide selected from the group consisting of the li protein, LAMP-I, HLS-DM, 
1 0 HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface antigen, hepatitis B virus 
core antigen, Ty panicle, Ig-a protein, Ig-p protein, and Ig kappa chain signal sequence. 

In one embodiment, the expression vector fiirther comprises a second 
promoter sequence operably linked to a third nucleotide sequence encoding one or more 
heterologous HTL or CTL peptide epitopes. In another embodiment, the CTL peptide 
1 5 epitope comprises a structural motif for an HLA supcrtype, whereby the peptide CTL 
epitope binds to tv^'o or more members of the supertype with an affinity of greater that 
500 nM. In another embodiment, the CTL peptide epitopes have structural motifs that 
provide binding affinity for more than one HLA allele supertype. 

In one embodiment, the non-human mammal is a transgenic mouse that 
20 expresses a human HLA allele. In another embodiment, the human HLA allele is selected 
from the group consisting of Al 1 and A2.1 . In another embodiment, the non-human 
mammal is a macaque that expresses a human HLA allele. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 shows the nucleotide and amino acid sequences (SEQ ID NOS: 1 

and 2, respectively) of the liPADRE construct encoding a fiision of the murine li gene 
with a pan DR epitope sequence substituted for the CLIP sequence of the li protein. 

Figure 2 shows the nucleotide and amino acid sequences (SEQ ID N0S:3 
and 4, respectively) of the I80T construct encoding a fiision of the cytoplasmic domain, 
30 the transmembrane domain and part of the luminal domain of the li protein fiised to 
multiple MHC class II epitopes. 

Figure 3 shows the nucleotide and amino acid sequences (SEQ ID N0S:5 
and 6, respectively) of the liThfiill construct encoding a fiision of the cytoplasmic 
domain, transmembrane domain and a portion of the luminal domain of the li protein 
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fused to multiple T helper epitopes and amino acid residues 101 to 215 of the li protein, 
which encodes the trimerization region of the li protein. 

Figure 4 shows the nucleotide and amino acid sequences (SEQ ID N0S:7 
and 8, respectively) of the KappaLAMP-Th construct encoding a fusion of the murine 
5 immunoglobulin kappa signal sequence fiised to multiple T helper epitopes and the 
transmembrane and cytoplasmic domains of LAMP- 1. 

Figure 5 shows the nucleotide and amino acid sequences (SEQ ID N0S:9 
and 10, respectively) of the H2M-Th construct encoding a fusion of the signal sequence 
of H2-M fused to multiple MHC-class H.epitopes and the transmembrane and 

1 0 cytoplasmic domains of H2-M. 

Figure 6 shows the nucleotide and amino acid sequences (SEQ ID N0S:11 
and 12, respectively) of the H20-Th construct encoding a fusion of the signal sequence of 
H2-D0 fused to multiple MHC class II epitopes and the transmembrane and cytoplasmic 
domains of H2-D0. 

1 5 Figure 7 shows the nucleotide and ammo acid sequences (SEQ ID NOS: 1 3 

and 14, respectively) of the PADRE-Influenza matrix construct encoding a fusion of a 
pan DR epitope sequence fused to the amino-terminus of influenza matrix protein. 

Figure 8 shows the nucleotide and amino acid sequences (SEQ ID N0S:15 
and 16, respectively) of the PADRE-HBV-s construct encoding a fusion of a pan DR 
20 epitope sequence fiised to the amino-terminus of hepatitis B virus surface antigen. 

Figure 9 shows the nucleotide and amino acid sequences (SEQ ID NOS: 17 
and 18, respectively) of the Ig-alphaTh construct encoding a fusion of the signal sequence 
of the Ig-a protem fused to multiple MHC class II epitopes and the transmembrane and 
cytoplasmic domains of the Ig-a protein. 
25 Figure 1 0 shows the nucleotide and amino acid sequences (SEQ ID 

NOS: 19 and 20, respectively) of the Ig-betaTh construct encoding a fusion of the signal 
sequence of the Ig-p protein fused to multiple MHC class II epitopes and the 
transmembrane and cytoplasmic domains of the Ig-P protein. 

Figure 1 1 shows the nucleotide and amino acid sequences (SEQ ID 
30 N0S:21 and 22, respectively) of the SigTh construct encoding a fusion of the signal 
sequence of the kappa immunoglobulin fused to multiple MHC class H epitopes. 

Figure 1 2 shows the nucleotide and amino acid sequences (SEQ ID 
N0S:23 and 24, respectively) of human HLA-DR, the invariant chain (li) protein. 
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Figure 13 shows the nucleotide and amino acid sequences (SEQ E) 
NOS:25 and 26, respectively) of human lysosomal membrane glycoprotein-! (LAMP-1). 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:27 and 28, respectively) of human HLA-DMB. 
5 Figure 1 5 shows the nucleotide and amino acid sequences (SEQ ID 

NOS:29 and 30, respectively) of human HLA-DO beta. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ED 
N0S:3 1 and 32, respectively) of the human MB-1 Ig-a. 

— - Figure-!-? shows the nucleotide and amino acid sequences (SEQ ID 

10 NOS:33 and 34, respectively) of human Ig-P protein. 

Figure 18 shows a schematic diagram depicting the method of generating 
some of the constructs encoding a MHC class 11 targeting sequence fused to multiple 
MHC class n epitopes. 

. Figure 19 shows the nucleotide sequence of the vector pEP2 (SEQ ID 

15 NO:35). 

Figure 20 shows the nucleotide sequence of the vector pMIN.O (SEQ ID 

NO:36). 

Figure 21 shows the nucleotide sequence of the vector pMIN.l (SEQ ID 

NO:37). 

20 Figure 22. Representative CTL responses in HLA-A2. 1/K''-H-2'"'' mice 

immunized with pMin.l DNA. Splenocytes from primed animals were cultured in 
triplicate flasks and stimulated twice in vitro with each peptide epitope. Cytotoxicity of 
each culture was assayed in a ^?Cr release assay against Jurkat-A2.1/K'' target cells in the 
presence (filled symbols, solid lines) or absence (open symbols, dotted lines) of peptide. 

25 Each symbol represents the response of a single culture. 

Figure 23. Presentation of viral epitopes to specific CTLs by Jurkat- 
A2.1/K'' tumor cells transfected with DNA minigene. Two constructs were used for 
transfection,.pMin.l and pMin.2-GFP. pMin.2-GFP-transfected targets cells were sorted 
by FACS and the population used in this experiment contained 60% fluorescent cells. 

30 CTL stimulation was measured by quantitating the amount of IFN-y release (A, B) or by 
lysis of ^^Cr-labeled target cells (C, D, hatched bars). CTLs were stimulated with 
transfected cells (A, C) or with parental Jurkat-A2.1/K'' cells in the presence of 1 ng/ml 
peptide (B, D). Levels of IFN- y release and cytotoxicity for the different CTL lines in 
the absence of epitope ranged from 72-126 pg/ml and 2-6% respectively. 
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Figure 24. Summary of modified minigene constructs used to address 
variables critical for in vivo immunogenicity. The following modifications were 
incorporated into the prototype pMin.l construct; A, deletion of PADRE HTL epitope; B, 
incorporation of the native HBV Pol 551 epitope that contains an alanine in position 9; C, 

5 deletion of the Ig kappa signal sequence; and D, switching position of the HBV Env 335 
and HBV Pol 455 epitopes. 

Figure 25. Examination of variables that may-influence pMin.l 
immunogenicity. In vivo CTL-inducing activity of pMin.l is compared to modified 
constructs. For ease-of comparison.-the.CILresponse induced by each of the modified 

1 0 DNA minigene constructs (shaded bars) is compared separately in each of the four panels 
to the response induced by the prototype pMin.l construct (solid bars). The geometric 
mean response of CTL-positive cultures fi-om two to five independent experiments are 
shown. Numbers shown with each bar indicate the number of positive cultures/total 
number tested for that particular epitope; The ratio of positive cultures/total tested for the 

15 pMin.l group is shown in panel A and is the same for the remaining Figure panels (see 
Example V, Materials and Methods, in vitro CTL cultures, for the definition of a positive 
CTL culture). Theradigm responses were obtained by immunizing. animals with the 
lipopeptide and stimulating and testing splenocyte cultures with the HBV Core 18-27 
peptide. 

20 

DEFINITIONS 

An "HTL" peptide epitopeor an "MHC II epitope" is an MHC class n 
restricted epitope, i.e., one that is bound by an MHC class II molecule. 

A "CTL" peptide epitope or an "MHC I epitope" is an MHC class I 
25 restricted epitope, i.e., one that is bound by an MHC class I molecule. 

An "MHC targeting sequence" refers to a peptide sequence that targets a 
polypeptide, e.g., comprising a peptide epitope, to a cytosolic pathway (e.g., an MHC 
class I antigen processing pathway), en endoplasmic reticulum pathwasy, or an endocytic 
pathway (e.g., an MHC class II antigen processing pathway). 
30 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences firom unrelated genes 
arranged to make a new fiinctional nucleic acid, e.g., a promoter fi-om one source and a 
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coding region froin another source. Similarly, a heterologous protein indicates that the 
protein comprises uvo or more subsequences that are not found in the same relationship to 
each other in nature, e.g., a fusion polypeptide comprising subsequence from different 
polypeptides, peptide epitopes from the same polypeptide that are not naturally in an 
5 adjacent position, or repeats ofa single peptide epitope. 

As used herein, the term "universal MHC class II epitope" or a "universal 
HTL epitope" refers to a MHC class II peptide epit&pe thafbinds to gene products of 
multiple MHC class H alleles. For example, the DR, DP and DQ alleles are human MHC 
- n alleles. Generally, a- unique-set of-peptides -binds to a particular gene product ofa MHC 
10 class II allele. In contrast, a universal MHC class II epitope is able to bind to gene 

products of multiple MHC class II alleles. A universal MHC class II epitope binds to 2 or 
more MHC class II alleles, generally 3 or more MHC class II alleles, and particularly 5 or 
more MHC class II alleles. Thus, the presence of a universal MHC class II epitope in an 
expression vector is advantageous in that it fiuictions to increase the number of allelic 
1 5 MHC class n molecules that can bind to the peptide and, consequently, the number of 
Helper T lymphocytes that are activated. 

Universal MHC class II epitopes are well known in the art and include, for 
example, epitopes such as the "pan DR epitopes," also referred to as "PADRE" 
(Alexander et al. Immunity 1:751-761 (1994); WO 95/07707, USSN 60/036,713, USSN 
20 60/037,432, PCT,aJS98/01373, 09/009,953, and USSN 60/087,192 each of which is 

incorporated herein by reference). A "pan DR binding peptide" or a "PADRE" peptide of 
the invention is a peptide capable of binding at least about 7 different DR molecules, 
preferably 7 of the 12 most common DR molecules, most preferably 9 of the 12 most 
common DR molecules (DRl, 2w2b, 2w2a, 3, 4w4, 4wl4, 5, 7, 52a, 52b, 52c, and 53), or 
25 alternatively, 50% of a panel of DR molecules representative of greater than or equal to 
75% of the human population, preferably greater than or equal to 80% of the human 
population. Pan DR epitopes can bind to a number of DR alleles and are strongly 
immunogenic for T cells. For example, pan DR epitopes were found to be more effective 
at inducing an immune response than natural MHC class H epitopes (Alexander, supra). 
30 An example of a PADRE epitope is the peptide 

AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38) (for additional 
examples of PADRE epitopes, see Table 8 of TIC docket No. 018623-006221, filed May 
12, 1999, USSN , herein incorporated by reference in its entirety). 
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With regard to a particular amino acid sequence, an "epitope" is a set of 
amino acid residues which is involved in recognition by a particular immunoglobulin, or 
in the context of T cells, those residues necessary for recognition by T cell receptor 
proteins and/or Major ffistocompatibiUty Complex (MHC) receptors. In an immune 
5 system setting, in vivo or in vitro, an epitope is the collective features of a molecule, such 
as primary, secondary and tertiary peptide structure, and charge, that together form a site 
recognized by an immunoglobulin, T cell receptor or HLA molecule. Throughout this 
disclosure epitope and peptide are often used interchangeably. It is to be appreciated, 
however, that isolated or purified protein or peptide molecules larger than and comprising 
1 0 an epitope of the invention are still within the bounds of the invention. 

As used herein, "high affinity" with respect to HLA class I molecules is 
defined as binding with an IC50 (or Kd) of less than 50 nM. "Intermediate affinit>'" is 
binding with an IC50 (or Kd) of between about 50 and about 500 nM. "High affinity" 
with respect to binding to HLA class II molecules is defined as binding with an Kd of 
15 less than 100 nM. "Intermediate affinity" is binding with a Kd of between about 100 and 
about 1000 nM. Assays for determining binding are described in detail, e.g., m PCT 
publications WO 94/20127 and WO 94/03205. Alternatively, binding is expressed 
relative to a reference peptide. As a particular assay becomes more, or less, sensitive, the 
IC50S of the peptides tested may change somewhat. However, the binding relative to the 
20 reference peptide will not significantly change. For example, in an assay run under 

conditions such that the IC50 of the reference peptide increases 10-fold, the IC50 values 
of the test peptides will also shift approximately 10-fold. Therefore, to avoid ambiguities, 
the assessment of whether a peptide is a good, intermediate, weak, or negative binder is 
generally based on its IC50, relative to the IC50 of a standard peptide. 
25 Throughout this disclosure, results are expressed in terms of "ICSOs." 

IC50 is the concentration of peptide in a binding assay at which 50% inhibition of binding 
of a reference peptide is observed. Given the conditions in which the assays are run (i.e.. 
limiting HLA proteins and labeled peptide concentrations), these values approximate KD 
values. It should be noted that IC50 values can change, often dramatically, if the assay 
30 conditions are varied, and depending on the particular reagents used (e.g., HLA 

preparation, etc.). For example, excessive concentrations of HLA molecules will increase 
the apparent measured IC50 of a given ligand. 

The terms "identical" or percent "identity," in the context of two or more 
peptide sequences, refer to two or more sequences or subsequences that are the same or 
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have a specified percentage of amino acid residues that are the same, when compared and 
aligned for maximum correspondence over a comparison window, as measured using a 
sequence comparison algorithms using default program parameters or by manual 
alignment and visual inspection. 

5 The phrases "isolated" or "biologically pure" refer to material which is 

substantially or essentially free from components which normally accompany the material 
as it is found in its native state. Thus, isolated peptidesTn accordance with the invention 
preferably do not contain materials normally associated with the peptides in their in situ 
environment. - . - 

1 0 "Major histocompatibility complex" or "MHC" is a cluster of genes that 

plays a role in control of the cellular interactions responsible for physiologic immune 
responses. In humans, the MHC complex is also known as the HLA complex. For a 
detailed description of the MHC and HLA complexes, see Paul, Fundamental 
Immunology (3rd ed. 1993). 

1 5 "Human leukocyte antigen" or "HLA" is a human class I or class II major 

histocompatibility complex (MHC) protein {see. e.g., Stites, et al.. Immunology, (8th ed., 
1994). 

An "HLA supertype or family", as used herein, describes sets of HLA 
molecules grouped on the basis of shared peptide-binding specificities. HLA class I 

20 molecules that share somewhat similar binding affinity for peptides bearing certain amino 
acid motifs are grouped into HLA supertypes. The terms HLA superfamily, HLA 
supenype family, HLA family, and HLA xx-like supertype molecules (where xx denotes 
a particular HLA type), are synonyms. 

The term "motif refers to the pattern of residues in a peptide of defined 

25 length, usually a peptide of from about 8 to about 1 3 amino acids for a class I HLA motif 
and from about 6 to about 25 amino acids for a class II HLA motif, which is recognized 
by a particular HLA molecule. Peptide motifs are typically different for each protein 
encoded by each human HLA allele and differ in the pattern of the primary and secondary 
anchor residues. 

30 A "supermotif is a peptide binding specificity shared by HLA molecules 

encoded by two or more HLA alleles. Thus, a preferably is recognized with high or 
intermediate affinity- (as defined herein) by two or more HLA antigens. 

"Cross-reactive binding" indicates that a peptide is bound by more than 
one HLA molecule; a synonym is degenerate binding. 
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The term "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other, typically by peptide bonds between the a-amino and carboxyl groups of 
adjacent amino acids. The preferred CTL-inducing oligopeptides of the invention are 13 
5 residues or less in length and usually consist of between about 8 and about 1 1 residues, 
preferably 9 or 10 residues. The prefened HTL-inducing oligopeptides are less than 
about 50 residues in length and usually consist of between about 6 and about 30 residues, 
more usually between about 12 and 25, and often between about 15 and 20 residues. 

- An "immunogenic peptide" or "peptide-epitope" is a peptide which 

10 comprises an allele-specific motif or supermotif such that the peptide will bind an HLA 
molecule and induce a CTL and/or HTL response. Thus, immunogenic peptides of the 
invention are capable of binding to an appropriate HLA molecule and thereafter inducing 
a cytotoxic T cell response, or a helper T cell response, to the antigen from which the 
immunogenic peptide is derived. 
1 5 A "protective immune response" refers to a CTL and/or an HTL response 

to an antigen derived fi-om an infectious agent or a tumor antigen, which prevents or at 
least partially arrests disease symptoms or progression. The immune response may also 
include an antibody response which has been facilitated by the stimulation of helper T 
cells. 

20 The term "residue" refers to an amino acid or amino acid mimetic 

incorporated into an oligopeptide by an amide bond or amide bond mimetic. 

"Synthetic peptide" refers to a peptide that is not natiorally occurring, but is 
man-made using such methods as chemical synthesis or recombinant DNA technology. 
The nomenclature used to describe peptide compounds foUows the 
25 conventional practice wherein the amino group is presented to the left (the N-terminus) 
and the carboxyl group to the right (the C-terminus) of each amino acid residue. When 
amino acid residue positions are referred to in a peptide epitope they are numbered in an 
amino to carboxyl direction with position one being the position closest to the amino 
terminal end of the epitope, or the peptide or protein of which it may be a part. In the 
30 formulae representing selected specific embodiments of the present invention, the amino- 
and carboxyl-terminal groups, although not specifically shown, are in the form they 
would assume at physiologic pH values, unless otherwise specified. In the amino acid 
structure formulae, each residue is generally represented by standard three letter or single 
letter designations. The L-forra of an amino acid residue is represented by a capital single 
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letter or a capital first letter of a three-letter symbol, and the D-form for those amino acids 
having D-forms is represented by a lower case single letter or a lower case three letter 
symbol. Glycine has no asymmetric carbon atom and is simply referred to as "Gly" or G. 

As used herein, the term "expression vector" is intended to refer to a 
nucleic acid molecule capable of expressing an antigen of interest such as a MHC class I 
or class 11 epitope in an appropriate target cell. An expression vector can be, for example, 
a plasraid or virus, including DNA or RNA viruses. The expression vector contains such 
a promoter element to express an antigen of interest in the appropriate cell or tissue in 
order to stimulate a desired immune response. 

DETAILED DESCRIPTION OF THE INVENTION 
Cytotoxic T lymphocytes (CTLs) and helper T lymphocytes (HTLs) are 
critical for iramunitx' against infectious pathogens; such as viruses, bacteria, and protozoa; 
tumor cells; autoimmunne diseases and the like. The present invention provides 
minigenes that encode peptide epitopes which induce a GTL and/or HTL response. The 
minigenesofthe invention also include an MHC targeting sequence. A variety of 
minigenes encoding different epitopes can be tested for immunogenicity using an HLA 
transgenic mouse. The epitopes are typically a combination of at least two or more HTL 
epitopes, or a CTL epitope plus a universal HTL epitope, and optinally include additional 
HTl and/or CTL epitopes. Two, three, four, five, six, seven, eight, nine, ten, twenty, 
thirty, forty or about fifty different epitopes, either HTL and/or CTL, can be included in 
the minigene, along with the MHC targeting sequence. The epitopes can have different 
HLA restriction. Epitopes to be tested include those derived firom viruses such as HIV, 
HBV, HCV, HSV, CMV, HPV, and HTLV; cancer antigens such as p53, Her2/Neu, 
MAGE, PSA, human papilloma virus, and CEA; parasites such as Trypanosoma, 
Plasmodium, Leishmania. Giardia, Entamoeba; autoimmune diseases such as rheumatoid 
arthritis, myestiienia gravis, and lupus erythematosus; fimgi such as Aspergillus and 
Candida; and bacteria such as Escherichia coli. Staphylococci, Chlamydia, Mycobacteria, 
Streptococci, and Pseudomonas. The epitopes to be encoded by the minigene are selected 
and tested using the methods described in published PCT applications WO 93/07421, WO 
94/02353, WO 95/01000, WO 97/04451, and WO 97/05348, herein incorporated by 
reference. 
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HTL and CTL Epitopes 

The expression vectors of the invention encode one or more MHC class II 
and/or class I epitopes and an MHC targeting sequence. Multiple MHC class II or class I 
epitopes present in an expression vector can be derived from the same antigen, or the 
MHC epitopes can be derived from different antigens. For example, an expression vector 
can contain one or more MHC epitopes that can be derived from two different antigens of 
the same virus or from two different antigens of different viruses. Furthermore, any 
MHC epitope can be used in the expression vectors of the invention. For example, any 
single MHC epitope or a combination of the MHC epitopes shown in.Tables 1 to 8 can be 
used in the expression vectors of the invention. Other peptide epitopes can be selected by 
one of skill in the an, e.g., by using a computer to select epitopes that contain HLA allele- 
specific motifs or supermotifs. The expression vectors of the invention can also encode 
one or more universal MHC class II epitopes, e.g., PADRE (see, e.g., SEQ ID NO:38 and 
Table 8 of TTC docket No. 018623-006221, filed May 12, 1999, USSN 

)• 

Universal MHC class II epitopes can be advantageously combined with 

other MHC class I and class II epitopes to increase the number of cells that are activated 
in response to a given antigen and provide broader population coverage of MHC-reactive 
alleles. Thus, the expression vectors of the invention can encode MHC epitopes specific 
for an antigen, universal MHC class n epitopes, or a combination of specific MHC 
epitopes and at least one universal MHC class II epitope. 

MHC class I epitopes are generally about 5 to 15 amino acids in length, in 
particular about 8 to . 11 amino acids in length. MHC class II epitopes are generally about 
10 to 25 amino acids in length, in particular about 13 to 21 amino acids in length. A 
MHC class I or II epitope can be derived from any desired antigen of interest.- The 
antigen of interest can be a viral antigen, surface receptor, tumor antigen, oncogene, 
enzyme, or any pathogen, cell or molecule for which an immune response is desired. 
Epitopes can be selected based on their abiUty to bind one or multiple HLA alleles, and 
can also be selected using the "analog" technique described below. 



Targeting Sequences 

The expression vectors of the invention encode one or more MHC epitopes 
operably linked to a MHC targeting sequence. The use of a MHC targeting sequence 
enhances the immune response to an antigen, relative to delivery of antigen alone, by 
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directing the peptide epitope to the site of MHC molecule assembly and transport to the 
cell surface, thereby providing an increased number of MHC molecule-peptide epitope 
complexes available for binding to and activation of T cells. 

MHC class 1 targeting sequences are used in the present invention, e.g., 
those sequences that target an MHC class I epitope peptide to a cytosolic pathway or to 
the endoplasmic reticulum (see. e.g., Rammensee et al, Immunogenetics 41 :178-228 
(1995)). For example, the cytosolic pathway processes" erfdo'genous ailtigens that are 
expressed inside the cell. Although not wishing to be bound by any particular theory, 
cytosolic proteins are thought to be at least partially degraded by an endopeptidase 
activity of a proteasorae and then transported to the endoplasmic reticulum by the TAP 
molecule (transporter associated with processing). In the endoplasmic reticulum, the 
antigen binds to MHC class I molecules. Endoplasmic reticulum signal sequences bypass 
the cytosolic processing pathway and directly target endogenous antigens to the 
endoplasmic reticulum, where proteolytic degradation into peptide fragments occurs. 
Such MHC class I targeting sequences are well known in the art, and include, e.g., signal 
sequences such as those from Ig kappa .tissue plasminogen activator or insulin. A 
preferred signal peptide is the human Ig kappa chain sequence. Endoplasmic reticulum 
signal sequences can also be used to target MHC class II epitopes to the endoplasmic 
reticulum, the site of MHC class I molecule assembly. 

MHC class n targeting sequences are also used in the invention, e.g., those 
that target a peptide to the endocytic pathway. These targeting sequences typically direct 
extracellular antigens to enter the endocytic pathway, which results in the antigen being 
transferred to the lysosomal compartment where the antigen is proteolytically cleaved 
into antigen peptides for binding to MHC class H molecules. As with the normal 
processing of exogenous antigen, a sequence that directs a MHC class II epitope to the 
endosomes of the endocytic pathway and/or subsequently to lysosomes, where the MHC 
class n epitope can bind to a MHC class 0 molecule, is a MHC class H targeting 
sequence. For example, group of MHC class II targeting sequences useful in the 
invention are lysosomal targeting sequences, which localize polypeptides to lysosomes. 
Since MHC class H molecules typically bind to antigen peptides derived from proteolytic 
processing of endocytosed antigens in lysosomes, a lysosomal targeting sequence can 
function as a MHC class II targeting sequence. Lysosomal targeting sequences are well 
known in the art and include sequences found in the lysosomal proteins LAMP-1 and 
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LAMP-2 as described by August et al. (U.S. Patent No. 5,633,234, issued May 27, 1997), 
which is incorporated herein by reference. 

Other lysosomal proteins that contain lysosomal targeting sequences 
include HLA-DM. HLA-DM is an endosomal/lysosomal protein that functions in 
facilitating binding of antigen peptides to MHC class II molecules. Since it is located in 
the lysosome, HLA-DM has a lysosomal targeting sequence that can fiinction as a MHC 
class II molecule targeting sequence (Copier et al. J. Immunol. 157:1017-1027 (1996), 
which is incorporated herein by reference). 

The resident lysosomal protein HLA-DO can also function as a lysosomal 
targeting sequence. In contrast to the above described resident lysosomal proteins 
LAMP-1 and HLA-DM, which encode specific Tyr-containing motifs that target proteins 
to lysosomes, HLA-DO is targeted to lysosomes by association with HLA-DM (Liljedahl 
et al., EMBO J. 15:4817-4824 (1996)), which is incorporated herein by reference. 
Therefore, the sequences of HLA-DO that cause association with HLA-DM and, 
consequently, translocation of HLA-DO to lysosomes can be used as MHC class II 
targeting sequences. Similarly, the murine homolog of HLA-DO, H2-D0, can be used to 
derive a MHC class II targeting sequence. A MHC class II epitope can be fused to HLA- 
DO or H2-D0 and targeted to lysosomes. 

In another example, the cytoplasmic domains of B cell receptor subunits 
Ig-a and Ig-P mediate antigen internalization and increase the efficiency of antigen 
presentation (Bonnerot et al.. Immunity 3:335-347 (1995)), which is incorporated herein 
by reference. Therefore, the cytoplasmic domains of the Ig-a and Ig-P proteins can 
function as MHC class II targeting sequences that target a MHC class 11 epitope to the 
endocytic pathway for processing and binding to MHC class II molecules. 

Another example of a MHC class II targeting sequence that directs MHC 
class II epitopes to the endocytic pathway is a sequence that directs polypeptides to be 
secreted, where the polypeptide can enter the endosomal pathway. These MHC class II 
targeting sequences that direct polypeptides to be secreted mimic the normal pathway by 
which exogenous, extracellular antigens are processed into peptides that bind to MHC 
class n molecules. Any signal sequence that fimctions to direct a polypeptide through the 
endoplasmic reticulum and ultimately to be secreted can function as a MHC class II 
targeting sequence so long as the secreted polypeptide can enter the endosomal/lysosomal 
pathway and be cleaved into peptides that can bind to MHC class n molecules. An 
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example of such a fusion is shown in Figure 1 1, where the signal sequence of kappa 
immunoglobulin is fused to multiple MHC class 11 epitopes. 

In another example, the li protein binds to MHC class II molecules in the 
endoplasmic reticulum, where it functions to prevent peptides present in the endoplasmic 

5 reticulum from binding to the MHC class H molecules. Therefore, fusion of a MHC class 
II epitope to the li protem targets the MHC class 11 epitope to the endoplasmic reticulum 
and a MHC class II molecule. For example, the CLIP sequence of the li protein can be 
removed and replaced with a MHC class n epitope sequence so that the MHC class II 
epitope is directed to -the. endoplasmic reticulum, where the epitope binds to a MHC class 

10 II molecule. 

In some cases, antigens themselves can serve as MHC class II or I 
targeting sequences and can be fused to a universal MHC class U epitope to stimulate an 
immune response. Although cytoplasmic viral antigens are generally processed and 
presented as complexes with MHC class I molecules, long-lived cytoplasmic proteins 
15 such as the influenza matrix protein can enter the MHC class H molecule processing 
pathway (Gueguen & Long, Proc. Natl. Acad. Sci. USA, 93:14692-14697 (1996)), which 
is incorporated herein by reference. Therefore, long-lived cytoplasmic proteins can 
function as a MHC class II targeting sequence. For example, an expression vector 
encoding influenza matrix protein fused to a universal MHC class II epitope can be 
20 advantageously used to target influenza antigen and the universal MHC class II epitope to 
the MHC class H pathway for stimulating an immune response to influenza. 

Other examples of antigens functioning as MHC class II targeting 
sequences include polypeptides that spontaneously form particles. The polypeptides are 
secreted from the cell that produces them and spontaneously form particles, which are 
25 taken up into an anrigen-presenting cell by endocytosis such as receptor-mediated 

endocytosis or are engulfed by phagocytosis. The particles are proteolytically cleaved 
into antigen peptides after entering the endosomal/lysosomal pathway. 

One such polypeptide that spontaneously forms particles is HBV surface 
antigen (HBV-S) (Diminsky et al. Vaccine 15:637-647 (1997); Le Borgne et al, 
30 Virology 240:304-315 (1998)), each of which is incorporated herein by reference. 

Another polypeptide that spontaneously forms particles is HBV core antigen (Kuhrober et 
al. International Immunol. 9:1203-1212 (1997)), which is incorporated herein by 
reference. Still another polypeptide that spontaneously forms particles is the yeast Ty 
protein (Weber et al, Vaccine 13:831-834 (1995)), which is incorporated herein by 



- 20 - 



wo 99/58658 ^ PCTAJS99/10646 





reference. For example, an expression vector containing HBV-S antigen fused to a 
universal MHC class H epitope can be advantageously used to target HBV-S antigen and 
the universal MHC class II epitope to the MHC class II pathway for stimulating an 
immune response to HBV. 

5 

Binding Affinity of Peptide Epitopes for HLA Molecules 

" The large degree of HtA polymorphism is an important factor to be taken 

into account with the epitope-based approach to vaccine development. To address this 
factor, epitope selection encompassing.identification of peptides capable of binding at 
1 0 high or intermediate affinity to multiple HLA molecules is preferably utilized, most 
preferably these epitopes bind at high or intermediate affinity to two or more allele 

specific HLA molecules. 

CTL-inducing peptides of interest for vaccine compositions preferably 
include those that have a binding affinity for class I HLA molecules of less than 500 nM. 
1 5 HTL-inducing peptides preferably include those that have a binding affinity for class H 
HLA molecules of less than 1000 nM. For example, peptide binding is assessed by 
testing the capacity of a candidate peptide to bind to a purified HLA molecule in vitro. 
Peptides exhibiting high or intermediate affinity are then considered for further analysis. 
Selected peptides are tested on other members of the supertype family. In preferred 
20 embodiments, peptides that exhibit cross-reactive binding are then used in vaccines or in 
cellular screening analyses. 

Higher HLA binding affinity is typically correlated with greater 
immunogenicity. Greater immunogenicity can be manifested in several different ways. 
Immunogenicity corresponds to whether an immune response is elicited at all, and to the 
25 vigor of any particular response, as well as to the extent of a population in which a 

response is elicited. For example, a peptide might elicit an immune response in a diverse 
array of the population, yet in no instance produce a vigorous response. In accordance 
with these principles, close to 90% of high binding peptides have been found to be 
immunogenic, as contrasted with about 50% of the peptides which bind with intermediate 
30 affinity. Moreover, higher binding affinity peptides leads to more vigorous immunogenic 
responses. As a result, less peptide is required to elicit a similar biological effect if a high 
affinity bindmg peptide is used. Thus, in preferred embodiments of the invention, high 
binding epitopes are particularly usefiil. 



- 21 - 



W099/S8658 




PCT/US99/10646 

relationship between binding affinity for HLA class I molecules and 



immunogenicity of discrete peptide epitopes on bound antigens has been determined for 
the first time in the art by the present inventors. The correlation between binding affinity 
and immunogenicity was analyzed in two different experimental approaches (Sette et ai, 
J. Immunol. 153:5586-5592 (1994)). In the first approach, the immunogenicity of 
potential epitopes ranging in HLA binding affinity over a 10,000-fold range was analyzed 
in HLA-A*0201 transgenic mice. -In the second approachrthe antigenicity of 
approximately 100 different hepatitis B virus (HBV)-derived potential epitopes, all 
- carrying A*0201- binding-motifs, was assessed by using PBL (peripheral blood 
lymphocytes) from acute hepatitis patients. Pursuant to these approaches, it was 
determined that an affinity threshold of approximately 500 nM (preferably 50 nM or less) 
determines the capacity of a peptide epitope to elicit a CTL response. These data are true 
for class I binding affinity measurements for naturally processed peptides and for 
synthesized T cell epitopes. These data also indicate the important role of determinant 
selection in the shaping of T cell responses (see. e.g., Schaeffer et al. Proc. Natl. Acad. 
Sci. USA 86:4649-4653, 1989). 



HLA class II DR molecules has also been delineated (see, e.g., Southwood et al. J. 
Immunology 160:3363-3373 (1998), and USSN 60/087192, filed 5/29/98). In order to 
define a biologically significant threshold of DR binding affinity, a database of the 
binding affmities of 32 DR-restricted epitopes for their restricting element (i.e., the HLA 
molecule that binds the motif) was compiled. In approximately half of the cases (15 of 32 
epitopes), DR restriction was associated with high binding affmities, i.e. binding affmities 
of less than 100 nM. In the other half of the cases (16 of 32), DR restriction was 
associated with intemiediate affinity (binding affinities in the 100-1000 nM range). In 
only one of 32 cases was DR restriction associated with an IC50 of 1000 nM or greater. 
Thus, 1000 nM can be defined as an affinity threshold associated with immunogenicity in 
the context of DR molecules. 

Peptide Epitope Binding Motifs and Supermotifs 

In the past few years evidence has accumulated to demonstrate that a large 
fi-action of HLA class 1 and class H molecules can be classified into a relatively few 
supertypes, each characterized by largely overlapping peptide binding repertoires, and 
consensus structures of the main peptide binding pockets. 



An affinity threshold associated with immunogenicity in the context of 
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For HLA molecule pocket analyses, the residues comprising the B and F 
pockets of HLA class I molecules as described in crystallographic studies were analyzed 
(Guo et al. Nature 360:364 (1992); Saper et al. J. Mol. Biol. 219:277 (1991); Madden et 
al, Ce// 75:693 (1993);Parhamera/.,/mmM«o/. Rev. 143:141 (1995)). In these analyses, 
residues 9, 45, 63, 66, 67, 70, and 99 were considered to make up the B pocket; and the B 
pocket was deemed to determine the specificity for the amino acid residue in the second 
position of peptide ligands. Similarly, residues 77,"80; 8 1, "and 11 6 were considered to 
determine the specificity of the F pocket; the F pocket was deemed to determine the 
specificity -for the G-terminal-residue-of-a peptide ligand bound by the HLA class I 
molecule. 

Through the study of single amino acid substituted antigen analogs and the 
sequencing of endogenously bound, naturally processed peptides, critical residues 
required for allele-specific binding to HLA molecules have been identified. The presence 
of these residues correlates with binding affinity for HLA molecules. The identification 
of motifs and/or supermotifs that correlate with high and intermediate affinity bindmg is 
an important issue with respect to the identification of immunogenic peptide epitopes for 
the inclusion in a vaccine. Kast et al. {J. Immunol. 152:3904-3912 (1994)) have shown 
that motif-bearing peptides account for 90% of the epitopes that bind to allele-specific 
HLA class I molecules. In this study all possible peptides of 9 amino acids in length and 
overiapping by eight amino acids (240 peptides), which cover the entire sequence of the 
E6 and E7 proteins of human papillomavirus type 1 6, were evaluated for binding to five 
allele-specific HLA molecules that are expressed at high firequency among different 
ethnic groups. This unbiased set of peptides allowed an evaluation of the predictive value 
of HLA class I motifs. From the set of 240 peptides, 22 peptides were identified that 
bound to an allele-specific HLA molecules with high or intermediate affinity. Of these 
22 peptides, 20, (i.e., 91%), were motif-bearing. Thus, this study demonstrates the value 
of motifs for the identification of peptide epitopes for inclusion in a vaccine: appUcation 
of motif-based identification techniques eliminates screening of 90% of the potential 
epitopes in a target antigen protein sequence. 

Peptides of the present invention may also include epitopes that bind to 
MHC class II DR molecules. There is a significant difference between class I and class II 
HLA molecules. This difference corresponds to the fact that, although a stringent size 
restriction and motif position relative to the binding pocket exists for peptides that bind to 
class I molecules, a greater degree of heterogeneity in both size and binding firame 
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position of the motif, relative to the N and C termini of the peptide, exists for class II 
peptide hgands. 

This increased heterogeneity of HLA class H peptide ligands is due to the 
structure of the binding groove of the HLA class II molecule which, unlike its class I 
counterpart, is open at both ends. Crystallographic analysis of HLA class II DRB*0101- 
peptide complexes showed that the residues occupying position 1 and position 6 of 
peptides complexed with DRB*0101 engage two complementary pockets on the 
DRBa*0101 molecules, with the PI position corresponding to the most crucial anchor 
-residue and the deepest hydrophobic pocket (see, e.g.. Madden, Ann. Rev. Immunol. 
13:587 (1995)). Other studies have also pointed to the P6 position as a crucial anchor 
residue for binding to various other DR molecules. 

Thus, peptides of the present invention are identified by any one of several 
HLA class I or II -specific amino acid motifs {see, e.g.. Tables I-III of USSN 09/226,775, 
and 09/239,043, herein incorporated by reference in their entirety). If the presence of the 
motif corresponds to the ability to bind several allele-specific HLA antigens it is referred 
to as a supermotif. The allele-specific HLA molecules that bind to peptides that possess a 
particular amino acid supermotif are collectively referred to as an HLA "supertype." 

Immune Response-Stimulating Peptide Analogs 

In general, CTL and HTL responses are not directed against all possible 
epitopes. Rather, they are restricted to a few "immunodominant" deteraiinants 
(Zinkemagel et al, Adv. Immunol. 11:5 1 59 (1979); Bennink et al. J. Exp. Med. 
168:1935-1939 (1988); Rawle et al. J. Immunol. 146:3977-3984 (1991)). It has been 
recognized that immunodominance (Benacerraf al, Science 175:273-279 (1972)) could 
be explained by either the ability of a given epitope to selectively bind a particular HLA 
protein (determinant selection theory) (Vitiello et al, J. Immunol. 131:1635 (1983)); 
Rosenthal et al. Nature 267:156-158 (1977)), or being selectively recognized by the 
existing TCR (T cell receptor) specificity (repertoire theory) (Klein, Immunology. The 
Science of Self on self Discrimination, pp. 270-3 10 (1982)). It has been demonstrated that 
additional factors, mostly linked to processing events, can also play a key role in 
dictating, beyond strict immunogenicity, which of the many potential determinants will 
be presented as immunodominant (Sercarz et al, Annu. Rev. Immunol 1 1 :729-766 
(1993)). 
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The concept of dominance and subdominance is relevant to 
immunotherapy of both infectious diseases and cancer. For example, in the course of 
chronic viral disease, recruitment of subdominant epitopes can be important for 
successful clearance of the infection, especially if dominant CTL or HTL specificities 
have been inactivated by functional tolerance, suppression, mutation of viruses and other 
mechanisms (Franco et al, Curr. Opin. Immunol. 7:524-531 (1995)). In the case of 
cancer and tumor antigens, CTLs recognizing at least some of the highest binding affinity 
peptides might be functionally inactivated. Lower binding affinity peptides are 
preferentially recognized at these times, and may therefore be preferred in therapeutic or 
prophylactic anti-cancer vaccines. 

In particular, it has been noted that a significant number of epitopes 
derived from known non-viral tumor associated antigens (TAA) bind HLA class I with 
intermediate affinity (IC50 in the 50-500 nM range). For example, it has been found that 
8 of 15 known TAA peptides recognized by tumor infiltrating lymphocytes (TIL) or CTL 
bound in the 50-500 nM range. (These data are in contrast with estimates that 90% of 
known viral antigens were bound by HLA class I molecules with IC50 of 50 nM or less, 
while only approximately 10% bound in the 50-500 nM range (Sette et al, J. Immunol, 
153:558-5592 (1994)). In the cancer setting this phenomenon is probably due to 
elimination, or functional inhibition of the CTL recognizing several of the highest binding 
peptides, presumably because of T cell tolerization events. 

Without intending to be bound by theory, it is believed that because T cells 
to dominant epitopes may have been clonally deleted, selecting subdominant epitopes 
may allow extant T cells to be recruited, which will then lead to a therapeutic or 
prophylactic response. However, the binding of HLA molecules to subdominant epitopes 
is often less vigorous than to dominant ones. Accordingly, there is a need to be able to 
modulate the binding affinity of particular immunogenic epitopes for one or more HLA 
molecules, and thereby to modulate the immune response elicited by the peptide, for 
example to prepare analog peptides which elicit a more vigorous response. This ability 
would greatly enhance the usefiUness of peptide-based vaccines and ther^eutic agents. 

Thus, although peptides with suitable cross-reactivity among all alleles of 
a superfamily are identified by the screening procedures described above, cross-reactivity 
is not always as complete as possible, and in certain cases procedures to further increase 
cross-reactivity of peptides can be useful; moreover, such procedures can also be used to 
modify other properties of the peptides such as binding affinity or peptide stability. 
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Having estabhSed the general rules that govern cross-reactivity of peptides for HLA 
alleles within a given motif or supennotif, modification (i.e., analoging) of the structure 
of peptides of particular interest in order to achieve broader (or otherwise modified) HLA 
binding capacity can be performed. More specifically, peptides which exhibit the 
5 broadest cross-reactivity patterns, can be produced in accordance with the teachings 
herein. The present concepts related to analog generation are set forth in greater detail in 

co-pending USSN09/226,7-75r - - - 

In brief, the strategy employed utilizes the motifs or supeimotifs which 
_ correlate with binding to certain HLA class I and n molecules. The motifs or supermotifs 
1 0 are defined by having primary anchors, and in many cases secondary anchors (see Tables 
I-III of USSN 09/226,775). Analog peptides can be created by substituting amino acids 
residues at primary anchor, secondary anchor, or at primary and secondary anchor 
positions. Generally, analogs are made for peptides that already bear a motif or 
supermotif. Preferred secondary anchor residues of supermotifs and motifs that have 
15 been defined for HLA class I and class H binding peptides are shown in Tables II and EI, 
respectively, of USSN 09/226,775. 

For a number of the motifs or supermotifs m accordance with the 
invention, residues are defined which are deleterious to binding to allele-specific HLA 
molecules or members of HLA supertypes that bind to the respective motif or supermotif 
20 {see Tables H and III of USSN 09/226,775). Accordingly, removal of such residues that 
are detrimental to binding can be performed in accordance with the methods described 
therein. For example, in the case of the A3 supertype, when all peptides that have such 
deleterious residues are removed from the population of analyzed peptides, the incidence 
of cross-reactivity increases from 22% to 37% (I., Sidney et al, Hu. Immunol. 45:79 
25 (1 996)). Thus, one strategy to improve the cross-reactivity of peptides within a given 
supermotif is simply to delete one or more of the deleterious residues present within a 
peptide and substimie a small "neutral" residue such as Ala (that may not influence T cell 
recognition of the peptide). An enhanced likelihood of cross-reactivity is expected if, 
together with elimination of detrimental residues within a peptide, "preferred" residues 
30 associated with high affinity binding to an allele-specific HLA molecule or to multiple 
HLA molecules within a superfamily are inserted. 

To ensure that an analog peptide, when used as a vaccine, actually elicits a 
CTL response to the native epitope in vivo (or, in the case of class H epitopes, a failure to 
elicit helper T cells that cross-react with the wild type peptides), the analog peptide may 
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be used to immunize T cells in vitro from'i'ndividuals of the appropriate HLA allele. 
Thereafter, the immunized cells' capacity to induce lysis of wild type peptide sensitized 
target cells is evaluated. In both class I and class II systems it will be desirable to use as 
targets, cells that have been either infected or transfected with the appropriate genes to 
establish whether endogenously produced antigen is also recognized by the relevant T 
cells. 

Another embodiment of the invention is-to-create analogs of-weak binding 
peptides, to thereby eiisure adequate numbers of cross-reactive cellular binders. Class I 
-peptides exhibiting binding affinities of 500-50000 nM,.and carrying an acceptable but 
suboptimal primary anchor residue at one or both positions can be "fixed" by subsiimting 
preferred anchor residues in accordance with the respective supertype. The analog 
peptides can then be tested for crossbinding activity. 

Another embodiment for generating effective peptide analogs involves the 
substitution of residues that have an adverse impact on peptide stability or solubility in, 
e.g., a liquid environment. This substitution may occur at any position of the peptide 
epitope. For example, a cysteine (C) can be substituted out in favor of gamma-amino 
butyric acid. Due to its chemical nature, cysteine has the propensity to form disulfide 
bridges and sufficiently aUer the peptide structurally so as to reduce binding capacity. 
Substituting gamma-amino butyric acid for C not only alleviates this problem, but 
actually improves binding and crossbinding capability in certain instances (Sette et al, In: 
Persistent Viral Infections (Ahmed & Chen, eds., 1998)). Substitution of cysteine with 
gamma-amino butyric acid may occur at any residue of a peptide epitope, i.e., at either 
anchor or non-anchor positions.. 

Expression Vectors and Construction of a Mlnigene 

The expression vectors of the invention contain at least one promoter 
element that is capable of expressing a transcription unit encoding the antigen of interest, 
for example, a MHC class I epitope or a MHC class II epitope and an MHC targeting 
sequence in the appropriate cells of an organism so that the antigen is expressed and 
targeted to the appropriate MHC molecule. For example, if the expression vector is 
administered to a mammal such as a human, a promoter element that fimctions in a 
human cell is incorporated into the expression vector. An example of an expression 
vector useful for expressing the MHC class II epitopes fiised to MHC class H targeting 
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sequences and the MHC class I epitopes described herein is the pEP2 vector described in 
Example IV. 

This invention rehes on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et al. Molecular Cloning. A Laboratory Manual (2nd ed. 1 989); Kriegler, 
Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in 
Molecular Biology iAnsM etal, eds., 1994); Oligonucleotide Synthesis: A Practical 
Approach (Gait, ed., 1984); Yji\)^tx%, Nucleic Acids Research 18(17):5197 (1994); 
Dueholm, J. Orgr Chem. 59:5767-5773 (1994); Methods in Molecular Biology, volume 
20 (Agrawal, ed.); and Tijssen, Laboratory Techniques in Biochemistry and Molecular 
Biology-Hybridization with Nucleic Acid Probes, e.g.. Part I, chapter 2 "Overview of 
principles of hybridization and the strategy of nucleic acid probe assays" (1993)). 

The minigenes are comprised of two or many different epitopes (see, e.g., 
Tables 1-8). The nucleic acid encoding the epitopes are assembled in a minigene 
according to standard techniques. In general, the nucleic acid sequences encoding 
minigene epitopes are isolated using amplification techniques with oligonucleotide 
primers, or are chemically synthesized. Recombinant cloning techniques can also be used 
when appropriate. Oligonucleotide sequences are selected which either amplify (when 
using PGR to assemble the minigene) or encode (when using synthetic oligonucleotides to 
assemble the minigene) the desired epitopes, 

Amplification techniques using primers are typically used to amplify and 
isolate sequences encoding the epitopes of choice Scorn DNA or RNA (see U.S. Patents 
4,683,195 and 4,683,202; PCR Protocols: A Guide to Methods and Applications (Innis et 
al., eds, 1990)). Methods such as polymerase chain reaction (PCR) and ligase chain 
reaction (LCR) can be used to amplify epitope nucleic acid sequences directly firom 
mRNA, fi-om cDNA, from genomic libraries or cDNA libraries. Restriction endonuclease 
sites can be incorporated into the primers. Minigenes ampUfied by the PCR reaction can 
be purified fi-om agarose gels and cloned into an appropriate vector. 

Synthetic oligonucleotides can also be used to construct minigenes. This 
method is performed using a series of overlapping oligonucleotides, representing both the 
sense and non-sense strands of the gene. These DNA firagments are then annealed, 
ligated and cloned. Oligonucleotides that are not commercially available can be 
chemically synthesized according to the solid phase phosphoramidite triester method first 
described by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an 
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automated synthesizer, as described in Van Devanter et. al, Nucleic Acids Res. 12:6159- 
6168 (1984). Purification of oligonucleotides is by either native acrylamide gel 
electrophoresis or by anion-exchange HPLC as described in Pearson & Reanier, J. 
CArom. 255:137-149 (1983). 
5 The epitopes of the minigene are typically subcloned into an expression 

vector that contains a strong promoter to direct transcription, as well as other regulatory 
sequences such arenhaircere'and polyBen)^at^^^^ promoters are'well 

knovra in the art and described, e.g., in Sambrook et al. and Ausubel et al. Eukaryotic 
— expression-systems-for mammaUan cells are well known in the art and are commercially 
10 available. Such promoter elements include, for example, cytomegalovirus (CMV), Rous 
sarcoma virus LTR and SV40. 

The expression vector typically contains a transcription unit or expression 
cassette that contains all the additional elements required for the expression of the 
minigene in host cells. A typical expression cassette thus contains a promoter operably 
1 5 linked to the minigene and signals required for efficient polyadenylation of the transcript. 
Additional elements of the cassette may include enhancers and introns with functional 
spHce donor and acceptor sites. 

In addition to a promoter sequence, the expression cassette can also 
contain a transcription termination region downstream of the structural gene to provide 
20 for efficient termination. The termination region may be obtained from the same gene as 
the promoter sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information 
into the cell is not particularly critical. Any of the conventional vectors used for 
expression in eukar> otic cells may be used. Expression vectors containing regulatory 
25 elements from eukaryotic viruses are typically used in eukaryotic expression vectors, e.g., 
SV40 vectors, papilloma virus vectors, and vectors derived from Epstein Bar virus. Other 
exemplary eukaryotic vectors include pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, 
baculovirus pDSVE, and any other vector allowing expression of proteins under the 
direction of the SV40 early promoter, SV40 later promoter, metallothioriein promoter, 
30 murine mammary tumor virus promoter, Rous sarcoma virus promoter, polyhedrin 

promoter, or other promoters shown effective for expression in eukaryotic cells. In one 
embodiment, the vector pEP2 is used in the present invention. 

Other elements that are typically included in expression vectors also 
include a replicon that functions in E. coli, a gene encoding antibiotic resistance to permit 
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selection of bacteria that harbor recombinant plasmids, and unique restriction sites in 
nonessential regions of the plasmid to allow insertion of eukaryotic sequences. The 
particular antibiotic resistance gene chosen is not critical, any of the many resistance 
genes known in the art are suitable. The prokaryotic sequences are preferably chosen 
such that they do not interfere with the replication of the DNA in eukaryotic ceils, if 
necessary. 

Administration In Vivo 

The invention also provides methods, for stimulating an immune response 

by administering an expression vector of the invention to an individual. Administration 
of an expression vector of the invention for stimulating an immune response is 
advantageous because the expression vectors of the invention target MHC epitopes to 
MHC molecules, thus increasing the number of CTL and HTL activated by the antigens 
encoded by the expression vector. 

Initially, the expression vectors of the invention are screened in mouse to 
determine the expression vectors having optimal activity in stimulating a desired immune 
response. Initial studies are therefore carried out, where possible, with mouse genes of 
the MHC targeting sequences. Methods of determining the activity of the expression 
vectors of the invention are well known in the art and include, for example, the uptake of 
^H-thymidine to measure T cell activation and the release of ^'Cr to measure CTL activity 
as described below in Examples H and IH. Experiments similar to those described in 
Example IV are performed to determine the expression vectors having activity at 
stimulating an immune response. The expression vectors having activity are further 
tested in human. To circumvent potential adverse immunological responses to encoded 
mouse sequences, the expression vectors having activity are modified so that the MHC 
class II targeting sequences are derived from human genes. For example, substimtion of 
the analogous regions of the human homologs of genes containing various MHC class II 
targeting sequences are substituted into the expression vectors of the invention. 
Examples of such human homologs of genes containing MHC class II targeting sequences 
are shown in Figures 12 to 17. Expression vectors containing human MHC class II 
targeting sequences, such as those described in Example I below, are tested for activity at 
stimulating an immune response in human. 

The invention also relates to pharmaceutical compositions comprising a 
phannaceutically acceptable carrier and an expression vector of the mvention. 
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Phaimaceutically acceptable carriers are well known in the art and include aqueous or 
non-aqueous solutions, suspensions and emulsions, including physiologically buffered 
saline, alcohol/aqueous solutions or other solvents or vehicles such as glycols, glycerol, 
oils such as olive oil or injectable organic esters. 
5 A pharmaceutically acceptable carrier can contain physiologically 

acceptable compounds that act, for example, to stabilize the expression vector or increase 
the absorption of the expression vector. Such physiologically acceptable compounds 
include, for example, carbohydrates, such as glucose, sucrose or dextrans, antioxidants 
such as ascorbic acid or glutathione, chelating agents, low molecular weight polypeptides, 
10 antimicrobial agents, inert gases or other stabilizers or excipients. Expression vectors can 
additionally be complexed with other components such as peptides, polypeptides and 
carbohydrates. Expression vectors can also be complexed to particles or beads that can 
be administered to an individual, for example, using a vaccine gun. One skilled in the art 
would know that the choice of a pharmaceutically acceptable carrier, including a 
1 5 physiologically acceptable compound, depends, for example, on the route of 
administration of the expression vector. 

The invention further relates to methods of administering a pharmaceutical 
composition comprising an expression vector of the invention to stimulate an immune 
response. The expression vectors are administered by methods well known in the art as 
20 described in Domielly et al. {Ann. Rev. Immunol. 15:617-648 (1997)); Feigner et al. (U.S. 
Patent No. 5,580,859, issued December 3, 1996); Feigner (U.S. Patent No. 5,703,055, 
issued December 30, 1997); and Carson et al. (U.S. Patent No. 5,679,647, issued October 
21, 1997), each of which is incorporated herein by reference. In one embodiment, the 
minigene is administered as naked nucleic acid. 
25 A pharmaceutical composition comprising an expression vector of the 

invention can be administered to stimulate an immune response in a subject by various 
routes including, for example, orally, intravaginally, rectally, or parenterally, such as 
intravenously, intramuscularly, subcutaneously, intraorbitally. intracapsularly, 
intraperitoneally, intracistemally or by passive or facilitated absorption through the skin 
30 using, for example, a skin patch or transdermal iontophoresis, respectively. Furthermore, 
the composition can be administered by injection, intubation or topically, the latter of 
which can be passive, for example, by direct application of an ointment or powder, or 
active, for example, using a nasal spray or inhalant. An expression vector also can be 
administered as a topical spray, in which case one component of the composition is an 
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appropriate propellant. The pharmaceutical composition also can be incorporated, if 
desired, into liposomes, microspheres or other polymer matrices (Feigner et al, U.S. 
Patent No. 5,703,055; Gregoriadis, liposome Technology, Vols. I to UI (2nd ed. 1993), 
each of which is incorporated herein by reference). Liposomes, for example, which 
5 consist of phospholipids or other lipids, are nontoxic, physiologically acceptable and 
metabolizable carriers that are relatively simple to make and administer. 

The expression vectors of the invention can be delivered to the interstitial 
spaces of tissues of an animal body (Feigner et al, U.S. Patent Nos. 5,580,859 and 
5,703,055). Atoihisffation of expression vectors of the invention to muscle is a 
10 particularly effective method of administration, including intradermal and subcutaneous 
injections and transdemial administration. Transdermal administration, such as by 
iontophoresis, is also an effective method to deliver expression vectors of the invention to 
muscle. Epidermal administration of expression vectors of the invention can also be 
employed. Epidermal administration involves mechanically or chemically irritating the 
1 5 outermost layer of epidermis to stimulate an immune response to the irritant (Carson et 
a/., U.S. Patent No. 5,679,647). 

Other effective methods of administering an expression vector of the 
invention to stimulate an immune response include mucosal administration (Carson et al, 
U.S. Patent No. 5,679,647). For mucosal administration, the most effective method of 
20 administration includes intranasal administration of an appropriate aerosol containing the 
expression vector and a pharmaceutical composition. Suppositories and topical 
preparations are also effective for dehvery of expression vectors to mucosal tissues of 
genital, vaginal and ocular sites. Additionally, expression vectors can be complexed to 
particles and administered by a vaccine gun. 
25 The dosage to be administered is dependent on the method of 

administration and will generally be between about 0. 1 ^g up to about 200 ^g. For 
example, the dosage can be from about 0.05 jig/kg to about 50 mg/kg, in particular about 
0.005-5 mg/kg. An effective dose can be determined, for example, by measuring the 
immune response after adininistration of an expression vector. For example, the 
30 production of antibodies specific for the MHC class H epitopes or MHC class I epitopes 
encoded by the expression vector can be measured by methods well known in the art, 
including ELISA or other immunological assays. In addition, the activation of T helper 
cells or a CTL response can be measured by methods well known in the art including, for 
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example, the uptake of ^H-thymidine to "measure T cell activation and the release of ''Cr 
to measure CTL activity (see Examples II and III below). 

The pharmaceutical compositions comprising an expression vector of the 
invention can be administered to mammals, particularly humans, for prophylactic or 
therapeutic purposes. Examples of diseases that can be treated or prevented using the 
expression vectors of the invention include infection with HBV, HCV, HIV and CMV as 
well as prostate cancer, renal carcinoma, cervical carcinoma, lymphoma, condyloma 
acuminatum and acquired immunodeficiency syndrome (AIDS). 

In therapeutic applications,-the expression vectors of the invention are 
administered to an individual already suffering from cancer, autoimmune disease or 
infected with a virus. Those in the incubation phase or acute phase of the disease can be 
treated with expression vectors of the invention, including those expressing all universal 
MHC class II epitopes, separately or in conjunction with other treatments, as appropriate. 

In therapeutic and prophylactic applications, pharmaceutical compositions 
comprising expression vectors of the invention are administered to a patient m an amount 
sufficient to elicit an effective immune response to an antigen and to ameliorate the signs 
or symptoms of a disease. The amount of expression vector to administer that is 
sufficient to ameliorate the signs or symptoms of a disease is termed a therapeutically 
effective dose. The amount of expression vector sufficient to achieve a therapeutically 
effective dose will depend on the pharmaceutical composition comprising an expression 
vector of the invention, the manner of administration, the state and severity of the disease 
being treated, the weight and general state of health of the patient and the judgment of the 
prescribing physician. 

All publications and patent applications cited in this specification are 
herein incorporated by reference as if each individual publication or patent application 
were specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it will be readily 
apparent to one of ordinary skill m the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto vwthout departing firom the spirit 
or scope of the appended claims. 
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EXAMPLES 

The following example is provided by way of illustration only and not by 
way of limitation. Those of skill in the art will readily recognize a variety of noncritical 
parameters that could be changed or modified to yield essentially similar results. 

EXAMPLE I: Construction of Expression Vectors C nntaining MHC Class II Epitopes 

This example shows construction of expression vectors containing MHC 
class n epitopes that can be used to target antigens to MHC class II molecules. 

' Expression vectors comprising DNA constructs were prepared using 
overlapping oligonucleotides, polymerase chain reaction (PCR) and standard molecular 
biology techniques pieffenbach & Dveksler, PCR Primer: A Laboratory Manual (1995); 
Sambrook et al. Molecular Cloning: A Laboratory Manual (2nd ed., 1989), each of 
which is incorporated herein by reference). 

To generate full length wild type li, the full length invariant chain was 
amplified, cloned, and sequenced and used in the construction of the three invariant chain 
constructs. Except where noted, the source of cDNA for all the constructs listed below 
was Mouse Spleen Marathon-Ready cDNAmade from Balb/c males (Clontech; Palo Alto 
CA). The primer pairs were the oligonucleotide 

GCTAGCGCCi3CCACCATGGATGACCAACGCGACCTC (SEQ ID NO:40), which is 
designated murli-F and contams an Nhel site followed by the consensus Kozak sequence 
and the 5' end of the li cDNA; and the oligonucleotide 

GGTACCTCACAGGGTGACTTGACCCAG (SEQ ID N0:41), which is designated 
murli-R and contains a Kpnl site and the 3' end of the li coding sequence. 

For the PCR reaction, 5 nl of spleen cDNA and 250 nM of each primer 
were combined in a 100 ^1 reaction with 0.25 mM each dNTP and 2.5 units oiPfu 
polymerase mPfu polymerase buffer containing 10 mM KCl, 10 mM (NH4)2S04, 20 mM 
Tris-chloride, pH 8.75. 2 mM MgS04, 0.1% TRITON X-100 and 100 ng/ml bovine serum 
albumin (BS A). A Perkin/Elmer 9600 PCR machine (Perkin Elmer; Foster City CA) was 
used and the cycling conditions were: 1 cycle of 95°C for 5 minutes, followed by 30 
cycles of 95°C for 15 seconds, 52°C for 30 seconds, and 72*'C for 1 minute. The PCR 
reaction was run on a 1% agarose gel, and the 670 base pair product was cut out, purified 
by spinning through a MilUpore Ultrafree-MC filter (Millipore; Bedford MA) and cloned 
into pCR-Blunt from Invitrogen (San Diego, CA). Individual clones were screened by 
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sequencing, and a correct clone (named bli#3) was used as a template for the helper 
constructs. 

DNA constructs containing pan DR epitope sequences and MHC II 
targeting sequences derived from the li protein were prepared. The li murine protein has 
been previously described (Zhu & Jones. Nucleic Acids Res. 17:447-448 (1989)), which is 
incorporated herein by reference. Briefly, the liPADRE construct contains the full length 
li sequence with PADRE precisely replacing the CLIP regionrihe DNA construct 
encodes amino acids 1 through 87 of invariant chain, followed with the 13 amino acid 
PADRE sequence (SEQ ID NO:38) and-the rest of the invariant chain DNA sequence 
(amino acids 101-215). The construct was amplified in 2 overlapping halves that were 
joined to produce the fmal construct. The two primers used to amplify the 5' half were 
murli-F and the oligonucleotide 

CAGGGTCCAGGCAGCCACGAACTTGGCCACAGGTTTGGCAGA (SEQ ID 
NO:42), which is designated liPADRE-R. The liPADRE-R primer includes nucleotides 
303-262 of liPADRE. The 3' half was amplified with the primer 
GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ID 
NO:43), which is designated liPADRE-F and includes nucleotides 288-330 of liPADRE; 
and murli-R. The PGR conditions were the same as described above, and the two halves 
were isolated by agarose gel electrophoresis as described above. 

Ten microliters of each PGR product was combined in a 1 GO jil PGR 
reaction with an annealing temperature of 50°C for five cycles to generate a full length 
template. Primers murli-F and murli-R were added and 25 more cycles carried out. The 
full length liPADRE product was isolated, cloned, and sequenced as described above. 
This construct contains the murine li gene with a pan DR epitope sequence substituted for 
the CLIP sequence of li (Figure 1 ). 

A DNA constmct, designated I80T, containing the cytoplasmic domain, the 
transmembrane domain and part of the luminal domain of li fused to a string of multiple 
MHC class II epitopes was constructed (Figure 2). Briefly, the string of multiple MHC 
class n epitopes was constructed with three overiapping oligonucleotides (oligos). Each 
oligo overlapped its neighbor by 15 nucleotides and the final MHC class H epitope string 
was assembled by extending the overiapping oligonucleotides in three sets of reactions 
using PGR. The three oligonucleotides were: ohgo 1, nucleotides 241-310, 
CTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAA 

CGAAGCTGGAAGAACCC (SEQ ID NO:44); 
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oligo 2, nucleotides 364-295, 
TTCTGGTCAGCAGAAAGAACAGGATAGGAGCGTTTGGAGGGCGATAAGCTGG 

AGGGGTTCTTCCAGCTTC (SEQ ID NO:45); and 

oligo 3, nucleotides 350-42, 
TTCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTG 

GCTGCCTGGACCCTGAAG (SEQ ID NO:46). 

For the first PCR reaction, 5 of oligos 1 and 2 were combined in a 100 
Hi reaction containing P/« polymerase. A Perkin/Elmer 9600 PCR machine was used and 
-the annealing temperature used was 45° C. -The PCR product was.gel-purified, and a 
second reaction containing the PCR product of oligos 1 and 2 with oligo 3 was annealed 
and extended for 10 cycles before gel purification of the fiill length product to be used as 
a "mega-primer." 

The I80T construct was made by amplifying bli#3 with murli-F and the 
mega-primer. The cycling conditions were: 1 cycle of 95°C for 5 minutes, followed by 5 
cycles of 95°C for 15 seconds, 37°C for 30 seconds, and 72°C for 1 minute. Primer Help- 
epR was added and an additional 25 cycles were carried out with the annealing 
temperature raised to 47°C. The Help-epR primer 

GGTACCTCAAGCGGCAGCCTTCAGGGTCCAGGCA (SEQ ID NO:47) corresponds 
to nucleotides 438-405. The full length I80T product was isolated, cloned, and sequenced 
as above. 

The I80T construct (Figure 2) encodes amino acid residues I through 80 of 
li, containing the cytoplasmic domain, the transmembrane domain and part of the luminal 
domain, fiised to a string of multiple MHC class II epitopes corresponding to: amino acid 

residues 323-339 of ovalbumin 

(IleSerGbiAlaValHisAlaAlaHisAlaGluIleAsnGluAlaGlyArg; SEQ ID NO:48); amino 
acid residues 128 to 141 of HBV core antigen (amino acids 

ThrProProAlaTyrArgProProAsnAlaProIleLeu; SEQ ID NO:49); amino acid residues 182 
to 196 of HBV env (amino acids PhePheLeuLeuThrArglleLeuThrlleProGhiSerLeuAsp; 
SEQ ID NO:50); and the pan DR sequence designated SEQ ID NO:38. 

ADNA construct containing the cytoplasmic domain, transmembrane 
domain and a portion of the luminal domain of li fiised to the MHC class II epitope string 
shown in Figure 2 and amino acid residues 101 to 215 of li encoding the trimerization 
region of li was generated (Figure 3). This construct, designated liThfull, encodes the 
first 80 amino acids of invariant chain followed by the MHC class II epitope string 
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(replacing CLff) and the rest of the invariant chain (ammo acids 101-215). Bnefly, the 
construct was generated as two overlapping halves that were annealed and extended by 
PGR to yield the final product. 

The 5' end of liThfuU was made by amplifying I80T with murli-F (SEQ 
ID NO:40) and Th-Pad-R. The Th-Pad-R primer AGCGGCAGCCTTCAGGGTC (SEQ 
ID N0:51) corresponds to nucleotides 429-411. The 3' half was made by amplifying 
bli#3 with liPADRE-F and murli-R (SEQ ID N0:41). The liPADRE-F primer 
GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ID NO:52) 
corresponds to nucleotides 402-444. Each PGR product was gel purified and mixed, then 
denatured, annealed, and extended by five cycles of PGR. Primers murli-F (SEQ ID 
NO:40) and murli-R (SEQ ID N0:41) were added and another 25 cycles performed. The 
fiiU length product was gel purified, cloned, and sequenced. 

All of the remaining constructs described below were made essentially 
according to the scheme shown in Figure 18. Briefly, primer pairs IF plus IR, designated 
below for each specific constmct, were used to amplify the specific signal sequence and 
contained an overlapping 1 5 base pair tail identical to the 5' end of the MHG class II 
epitope string. Primer pair Th-ova-F, ATGAGGGAGGGTGTGGAGGG (SEQ ID NO:53), 
plus Th-Pad-R (SEQ ID N0:51) were used to amplify the MHG class II epitope string. A 
15 base pair overlap and the specific transmembrane and cytoplasmic tail containing the 
targeting signals were amplified with primer pairs 2F plus 2R. 

All three pieces of each cDNA were amplified using the following 
conditions: 1 cycle of 95"C for 5 minutes, foUowed by 30 cycles of 95''C for 15 seconds, 
52"*G for 30 seconds, and 72''C for 1 minute. Each of the three fi-agments was agrose-gel 
purified, and the signal sequence and MHG class II string fi-agments were combined and 
joined by five cycles in a second PGR. After five cycles, primers IF and Th-Pad-R were 
added for 25 additional cycles and the PGR product was gel purified. This signal 
sequence plus MHG class II epitope string fragment was combined with the 
transmembrane plus cytoplasmic tail fragment for the final PGR. After five cycles, 
primers IF plus 2R were added for 25 additional cycles and the product was gel purified, 

cloned and sequenced. 

ADKA construct containing the murine immunoglobulin kappa signal 

sequence fiised to the T helper epitope string shown in Figure 2 and the transmembrane 

and cytoplasmic domains of LAMP-1 was generated (Figure 4) (Granger et al, J. Biol. 

Chem. 265:12036-12043 (1990)), which is incorporated by reference (mouse LAMP-1 
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GenBank accession No. M3201 5). This construct, designated kappaLAMP-Th, contains 
the consensus mouse immunoglobulin kappa signal sequence and was amplified from a 
plasmid containing full length immunoglobulin kappa as depicted in Figure 1 8. The 
primer IF used was the oligonucleotide designated KappaSig-F, 
5 GCTAGCGCCGCCACCATGGGAATGCAG(SEQIDNO:54). 

The primer IR used was the oligonucleotide designated Kappa-Th-R, 
CACAGCCTGGCTGATTCCTCTGGACCC (SEQ ID NO:55): " ' 

The primer 2F used was the oligonucleotide designated PAD/LAMP-F, 
CTGAAGGCTGCCGCTAACAACATGTTGATCCGC (SEQ ID NO:56). The primer 2R 

1 0 used was the oligonucleotide designated LAMP-CYTOR, 
GGTACCCTAGATGGTCTGATAGCC (SEQ ID NO:57). 

ADNA construct containing the signal sequence of H2-M fused to the 
MHC class II epitope string shown in Figure 2 and the transmembrane and cytoplasmic 
domains of H2-M was generated (Figure 5). The mouse H2-M gene has been described 
15 previously, Peleraux et al. Immunogenetics 43:204-214 (1996)), which is incorporated 
herein by reference. This construct was designated H2M-Th and was constructed as 
depicted in Figure 18. The primer IF used was the oligonucleotide designated H2-Mb- 
IF. GCC GCT AGO GCC GCC ACC ATG GCT GCA CTC TGG (SEQ ID NO:58). The 
primer IR used was the oligonucleotide designated H2-Mb-1R, CAC AGC CTG GCT 
20 GAT CCC CAT ACA GTG CAG (SEQ ID NO:59). The primer 2F used was the 

ohgonucleotide designated H2-Mb-2F, CTG AAG GCT GCC GCT AAG GTC TCT GTG 
TCT (SEQ ID NO:60). The primer 2R used was the oligonucleotide designated H2-Mb- 
2R, GCG GGT ACC CTAATG CCGTCC TTC (SEQ ID N0:61). 

ADNA construct containing the signal sequence of H2-D0 fused to the 
25 MHC class II epitope string shown in Figure 2 and the transmembrane and cytoplasmic 
domains of H2-D0 was generated (Figure 6). The mouse H2-D0 gene has been 
described previously (Larhammar e/ a/.. J. Biol. Chem. 260:14111-14119 (1985)), which 
is incorporated herein by reference (GenBank accession No. M19423). This construct, 
designated H20-Th, was constructed as depicted in Figure 18. The primer IF used was 
30 the oligonucleotide designated H2-0b-lF, GCG GCT AGC GCC GCC ACC ATG GGC 
GCT GGG AGG (SEQ ID NO:62). The primer IR used was the oligonucleotide 
designated H2-0b- 1 R, TGC ACA GCC TGG CTG ATG GAA TCC AGC CTC (SEQ ID 
NO:63). The primer 2F used was the oligonucleotide designated H2-Ob-2F, CTG AAG 
GCT GCC GCT ATA CTG AGT GGA GCT (SEQ ID NO:64). The primer 2R used was 
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the oligonucleotide designated H2-Ob-2R. GCC GGT ACC TCATGT GAC ATG TCC 

CG(SEQIDNO:65). 

A DNA construct containing a pan DR epitope sequence (SEQ ID NO:38) 
fused to the amino-terminus of influenza matrix protein is generated (Figure 7). This 
construct, designated PADRE-Influenza matrix, contains the universal MHC class II 
epitope PADRE attached to the amino terminus of the influenza matrix coding sequence. 
The construct is made using a long primer on the 5' end primer. The 5' primer is the 
oligonucleotide 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTATGAGTCTTCTAACCGAGGTCGA (SEQ ID NO:66). The 3' primer is the 
oHgonucleotide TCACTTGAATCGCTGCATCTGCACCCCCAT (SEQ ED NO:67). 
Influenza virus from the America Type Tissue Collection (ATCC) is used as a source for 
the matrix coding region (Perdue et al. Science 279:393-396 (1998)), which is 
incorporated herein by reference (GenBank accession No. AF036358). 

A DNA construct containing a pan DR epitope sequence (SEQ ID NO:38) 
fused to the amino-teminus of HBV-S antigen was generated (Figure 8). This construct 
is designated PADRE-HBV-s and was generated by annealing two overiapping 
oligonucleotides to add PADRE onto the amino terminus of hepatitis B surface antigen 
(Michel et al., Proc. Natl. Acad. Sci. USA 81:7708-7712 (1984); Michel et al, Proc. Natl. 
Acad. Sci. USA 92:5307-5311 (1995)), each of which is incorporated herein by reference. 
One oligonucleotide was 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTC (SEQ ED NO:68). The second oligonucleotide was 

CTCGAGAGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGGCCATGGTG 
GCGGCG (SEQ ID NO:69). When annealed, the oligos have Nhel and Xhol cohesive 
ends. The oUgos were heated to lOO^C and slowly cooled to room temperamre to anneal. 
A three part ligation joined PADRE with an Xhol-Kpnl fragment containing HBV-s 
antigen into the Nhel plus Kpnl sites of the expression vector. 

A DNA construct containing the signal sequence of Ig-a fused to the MHC 
class II epitope string shown in Figure 2 and the transmembrane and cytoplasmic domains 
of Ig-o was generated (Figure 9). The mouse Ig-a gene has been described previously 
(Kashiwamura et ai. J. Immunol. 145:337-343 (1990)), which is incorporated herein by 
reference (GenBank accession No. M31773). This construct, designated Ig-alphaTh, was 
constructed as depicted in Figure 18. The primer IF used was the oligonucleotide 
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designated Ig alpha-lF, GCG GCT AGC GCC GCC ACC ATG CCA GGG GGT CTA 
(SEQ ID NO:70). The primer IR used was the oligonucleotide designated Igalpha-IR, 
GCA CAG CCT GGC TGATGG CCT GGC ATC CGG (SEQ ID N0:71). The primer 2F 
used was the oligonucleotide designated Igalpha-2F, CTG AAG GCT GCC GCT GGG 
ATC ATC TTG CTG (SEQ ID NO: 72). The primer 2R used was the oligonucleotide 
designated Igalpha-2R, GCG GGT ACC TCA TGG CTT TTC CAG CTG (SEQ ID 
NO:73). 

ADNA construct containing the signal sequence of Ig-P fused to the MHC 
class II string shown in Figiffe 2 and the transmembrane aiid cytoplasmic domains of IgP 
was generated (Figure 10). The Ig-p sequence is the B29 gene of mouse and has been 
described previously (Hermanson et al. Proc. Natl. Acad. Sci. USA 85:6890-6894 
(1988)), which is incorporated herein by reference (GenBank accession No. J03857). 
This construct, designated Ig-betaTh, was constructed as depicted in Figure 18. The 
primer IF used was the oligonucleotide designated B29-1F (33mer) GCG GCT AGC 
GCC GCC ACC ATG GCC ACA CTG GTG (SEQ ID NO:74). The primer IR used was 
the oligonucleotide designated B29-1R (30mer) CAC AGC CTG GCT GAT CGG CTC 
ACC TGA GAA (SEQ ID NO:75). The primer 2F used was the oligonucleotide 
designated B292F (30mer) CTG AAG GCT GCC GCT ATT ATC TTG ATC CAG (SEQ 
ID NO: 76). The primer 2R used was the oligonucleotide designated B29-2R (27mer), 
GCC GGT ACC TCA TTC CTG GCC TGG ATG (SEQ ID NO:77). 

A DNA construct containing the signal sequence of the kappa 
immunoglobulin signal sequence fused to the MHC class II epitope string shown in 
Figure 2 was constructed (Figure 11). This construct is designated SigTh and was 
generated by using the kappaLAMP-Th construct (shown in Figure 4) and amplifying 
with die primer pair KappaSig-F (SEQ ID NO:54) plus Help-epR (SEQ ID NO:47) to 
create SigTh. SigTh contains the kappa inmiunoglobuhn signal sequence fused to the T 
helper epitope string and terminated with a translational stop codon. 

Constructs encoding human sequences corresponding to the above 
described constructs having mouse sequences are prepared by substituting human 
sequences for the mouse sequences. Briefly, for the liPADRE construct, conresponding to 
Figure 1, amino acid residues 1-80 from the human li gene HLA-DR sequence (Figure 
12) (GenBank accession No. X00497 M14765) is substituted for the mouse li sequences, 
which is fused to PADRE, followed by human invariant chain HLA-DR amino acid 
residues 114-223. For the I80T construct, corresponding to Figure 2, amino acid residues 
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1-80 from the human sequence of li is followed by a MHC class II epitope string. For the 
liThfiill constroct, corresponding to Figure 3, amino acid residues 1-80 from the human 
sequence of li, which is fused to a MHC class 11 epitope string, is followed by human 
invariant chain amino acid residues 1 1 4-223 . 

For the LAMP-Th construct, similar to Figure 4, the signal sequence 
encoded by amino acid residues 1-19 (nucleotides 1 1-67) of human LAMP-1 (Figure 13) 
(GenBank accession No. J04182), which is fused to the MHC class U epitope string, is 
followed by the transmembrane (nucleotides 1163-1213) and cytoplasmic tail 
(nucleotides 1214-1258) region encoded by amino acid-residues 380-416 of human 
LAMP-1. 

For the HLA-DM-Th constmct, corresponding to Figure 5, the signal 
sequence encoded by amino acid residues 1-17 (nucleotides 1-51) of human HLA-DMB 
(Figure 14) (GenBank accession No. U15085), which is ftised to the MHC class 11 epitope 
string, is followed by the transmembrane (nucleotides 646-720) and cytoplasmic tail 
(nucleotides 721-792) region encoded by amino acid residues 216-263 of human HLA- 
DMB. 

For the HLA-DO-Th construct, corresponding to Figure 6, the signal 
sequence encoded by amino acid residues 1-21 (nucleotides 1-63) of human HLA-DO 
(Figure 15) (GenBank accession No. L29472 J02736 N00052), which is fused to the 
MHC class n epitope siring, is followed by the transmembrane (nucleotides 685-735) and 
cytoplasmic tail (nucleotides 736-819) region encoded by amino acid residues 223-273 of 

human HLA-DO. 

For the Ig-alphaTh construct, corresponding to Figure 9, the signal 
sequence encoded by amino acid residues 1-29 (nucleotides 1-87) of human Ig-a MB-1 
(Figure 16) (GenBank accession No. U05259), which is fused to the MHC class II epitope 
string, is followed by the transmembrane (nucleotides 424-498) and cytoplasmic tail 
(nucleotides 499-678) region encoded by amino acid residues 142-226 of human Ig-a 
MB-1. 

For the Ig-betaTh construct, corresponding to Figure 10, the signal 
sequence encoded by amino acid residues 1-28 (nucleotides 17-100) of human Ig-p B29 
(Figure 17) (GenBank accession No. M80461), which is fused to the MHC class 11 
epitope string, is followed by the transmembrane (nucleotides 500-547) and cytoplasmic 
tail (nucleotides 548-703) region encoded by amino acid residues 1 56-229 of human Ig-p. 
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The SigTh construct shown in Figure 1 1 can be used in mouse and human. 
Alternatively, a signal sequence derived from an appropriate human gene containing a 
signal sequence can be substituted for the mouse kappa immunoglobulin sequence in the 

Sig Th construct. 

The PADRE-Influenza matrix construct shown in Figure 7 and the 
PADRE-HBVs construct shown in Figure 8 can be used in mouse and human. 

Some of the DNA constructs described above were cloned into the vector 
pEP2 (Figure 19; SEQ ID NO:35). The pEP2 vector was constructed to contain dual 
CMV promoters. The pEP2 vector used the backbone of pcDNA3. 1 (-)Myc-His A from 
Invitrogen and pIRES Ihyg from Clontech. Changes were made to both vectors before the 
CMV transcription unit from pIRESlhyg was moved into the modified pcDNA vector. 

The pcDNA3.1(-)Myc-His A vector (http://wvm.invitrbgen.com) was 
modified. Briefly, the PvuII fragment (nucleotides 1342-3508) was deleted. ABspHI 
fragment that contains the Ampicillin resistance gene (nucleotides 4404-5412) was cut 
out. The Ampicillin resistance gene was replaced with the kanamycin resistance gene 
from pUC4K (GenBank Accession #X06404). pUC4K was amplified with the primer set: 
TCTGATGTTACATTGCACAAG (SEQ ID NO:78) (nucleotides 1621-1601) and 
GCGCACTCATGATGCTCTGCCAGTGTTACAACC (SEQ ID NO:79) (nucleotides 
682-702 plus the addition of a BspHI restriction site on the 5' end). The PGR product 
was digested with BspHI and ligated into the vector digested with BspHI. The region 
between the Pmel site at nucleotide 905 and the EcoRV site at nucleotide 947 was 
deleted. The vector was then digested with Pmel (cuts at nucleotide 1076) and Apal (cuts 
at nucleotide 1004), Klenow filled in at the cohesive ends and ligated. The Kpnl site at 
nucleotide 994 was deleted by digesting with Kpnl and filling in the ends with Klenow 
DNA polymerase, and ligating. The infron A sequence from CMV (GenBank accession 
M21295, nucleotides 635-1461) was added by amplifying CMV DNA with the primer set: 
GCGTCTAGAGTAAGTACCGCCTATAGACTC (SEQ ID NO:80) (nucleotides 635-655 
plus an Xbal site on the 5' end) and CCGGCTAGCCTGCAGAAAAGACCCATGGAA 
(SEQ ID N0:81) (nucleotides 1461-1441 plus anNhel site on the 3' end). The PGR 
product was digested with Xbal and Nhel and ligated into the Nhel site of the vector 
(nucleotide 895 of the original pcDNA vector) so that the Nhel site was on the 3' end of 
the intron. 

To modify the pIRESlhyg vector (GenBank Accession U89672, 
Clontech), the Kpnl site (nucleotide 911) was deleted by cutting and filling in with 
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Klenow. The plasmid was cut with NotI (nucleotide 1254) and Xbal (nucleotide 3 196) 
and a polyhnker oligo was inserted into the site. The polylinker was fomed by annealing 
the following two oligos: 

GGCCGCAAGGAAAAAATCTAGAGTCGGCCATAGACTAATGCCGGTACCG(SEQ 

5 ID NO:82) and 

CTAGCGGTACCGGCATTAGTCTATGGCCCGACTCTAGATTTTTTCCTTGC(SEQ 
ID NO:83). The resulting plasmid was cut with Hindi and the fragment between Hindi 
sites 234 and 3538 was isolated and ligated into the modified pcDNA vector. This 
fragment contains a CMV promoter, intron, polylinker, and polyadenylation signal. 

1 0 The pIREShyg piece and the pcDNA piece were combined to form pEP2. 

The modified pcDNA3.1(-)Myc-His A vector was partially digested with PvuH to isolate 
a linear fragment with the cut downstream of the pcDNA polyadenylation signal (the 
other PvuII site is the CMV intron). The Hindi fragment from the modified pIRESlhyg 
vector was ligated into the PvuII cut vector. The polyadenylation signal from the pcDNA 

1 5 derived transcription unit was deleted by digesting with EcoRI (pcDN A nucleotide 95 5) 
and Xhol (pIRESlhyg nucleotide 3472) and replaced with a synthetic polyadenylation 
sequence. The synthetic polyadenylation signal was described in Levitt et al.. Genes and 
Development 3:1019-1025 (1989)). 

Two oligos were annealed to produce a fragment that contained a 

20 polylinker and polyadenylation signal with EcoRI and Xhol cohesive ends. The oligos 
were: 

AATTCGGATATCC.\AGCTTGATGAATAAAAGArCAGAGCTCTAGTGATCTGTGT 
GTTGGTTTTTTTGTGTGC (SEQ ID NO:84) and 

TCGAGCACACAAAAAACCAACACACAGATCACTAGAGCTCTGATCTTTTTATT 

25 CArCAAGCTTGGATATCCG(SEQIDNO:85). 

The resulting vector is named pEP2 and contains two separate 
transcription units. Both transcription units use the same CMV promoter but each 
contains different intron, polylinker, and polyadenylation sequences. 

The pEP2 vector contains two transcription units. The jBrst transcription 

30 unit contains the CMV promoter initially from pcDNA (nucleotides 21 0-862 in Figure 
19), CMV intron A sequence (nucleotides 900-1728 in Figure 19), polylinker cloning site 
(nucleotides 1740-1760 in Figure 19) and synthetic polyadenylation signal (nucleotides 
1764-1769 in Figure 19). The second transcription unit, which was initially derived from 
pIRESlhyg, contains the CMV promoter (nucleotides 3165-2493 in Figure 19), intron 
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sequence (nucleotides 2464-2173 in Figure 19), polylinker clone site (nucleotides 2126- 
2095 in Figure 19) and bovine growth hormone polyadenylation signal (nucleotides 1979- 
1974 in Figure 19). The kanamycin resistance gene is encoded in nucleotides 4965-4061 
(Figure 19). 

The DNA constructs described above were digested vvith Nhel and Kpnl 
and cloned into the Xbal and Kpnl sites of pEP2 (the second transcription unit). 

Additional vectors were also constructed. To test for the effect of co- 
expression of MHC class I epitopes with MHC class II epitopes, an insert was generated, 
designated AOS, that contains nine MHC class I epitope's. The AOS insert was initially 
constructed in the vector pMIN.O (Figure 20; SEQ ID NO:36). Briefly, the AOS insert 
contains nine MHC class I epitopes, six restricted by HLA-A2 and three restricted by 
HLA-All, and the universal MHC class H epitope PADRE. The vector pMIN.O contains 
epitopes from HBV, HIV and a mouse ovalbumin epitope. The MHC class I epitopes 
appear in pMIN.O in the following order: 

consensus mouse Ig Kappa signal sequence (pMIN.O amino acid residues 
1-20, nucleotides 16-81) MQVQIQSLFLLLLWVPGSRG (SEQ ID NO:86) encoded by 
nucleotides ATG CAG GTG CAG ATC CAG AGC CTG TTT CTG CTC CTC CTG TGG 
GTG CCC GGG TCC AGA GGA (SEQ ID NO:87); 

HBV pol 149-159 (All restricted) 
(pMIN.O amino acid residues 21-31, nucleotides 82- 114) 
HTLWKAGILYK (SEQ ID NO:88) encoded by nucleotides CAC ACC CTG TGG AAG 
GCC GGA ATC CTG TAT AAG (SEQ ID NO:89); 

PADRE-universal MHC class D epitope (pMIN.O amino acid residues 32- 
45. nucleotides 115-153) AKFVAAWTLKAAA (SEQ ID NO:38) encoded by nucleotides 
GCC AAG TTC GTG GCT GCC TGG ACC CTG AAG GCT GCC GCT (SEQ ID 
NO:90); 

HBV core 18-27 (A2 restricted) (pMIN.O amino acid residues 46-55, 
nucleotides 154-183) FLPSDFFPSV (SEQ ID N0:91) encoded by nucleotides TTC CTG 
CCT AGC GAT TTC TTT CCT AGC GTG (SEQ ID NO:92); 

HIV env 120-128 (A2 restricted) (pMIN.O amino acid residues 56-64, 
nucleotides 184-210) KLTPLCVTL (SEQ ID NO:93) encoded by nucleotides AAG CTG 
ACC CCA CTG TGC GTG ACC CTG (SEQ ID NO:94); 
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HBV pol 551-559 (A2 restricted) (pMIN.O ammo acid residues 65-73, 
nucleotides 211-237) YMDDVVLGA (SEQ ID NO:95) encoded by nucleotides TAT ATG 
GAT GAC GTG GTG CTG GGA GCC (SEQ ID NO:96); 

mouse ovalbumin 257-264 (K" restricted) (pMIN.O amino acid residues 
74-81, nucleotides 238-261) SUNFEKL (SEQ ID NO:97) encoded by nucleotides AGC 
ATC ATC AAC TTC GAG AAG CTG (SEQ ID NO:98); 

HBV pol 455-463 (A2 restricted) (pMIN.O amino acid residues 82-90, 
nucleotides 262-288) GLSRYVARL (SEQ ID NO:99) encoded by nucleotides GGA CTG 
TCCAGATACGTGGCTAGGCTG(SEQIDNO:100); - 

HIV pol 476-84 (A2 restricted) (pMIN.O amino acid residues 91-99, 
nucleotides 289-315) ILKEPVHGV (SEQ ID NO:101) encoded by nucleotides ATC CTG 
AAG GAG CCT GTG CAC GGC GTG (SEQ ID NO:102); 
HBV core 141-151 (All restricted) 
(pMIN.O amino acid residues 100-110, nucleotides 316-348) 
STLPETTWRR (SEQ ID NO: 1 03) encoded by nucleotides TCC ACC CTG CCA GAG 
ACC ACC GTG GTG AGG AGA (SEQ ID NO: 1 04); 

HIV env 49-58 (All restricted) (pMIN.O amino acid residues 111-120, 
nucleotides 349-378) TVYYGVPVWK (SEQ ID NO: 105) encoded by nucleotides ACC 
GTG TAC TAT GGA GTG CCT GTG TGG AAG (SEQ ID NO: 1 06); and 

HBV env 335-343 (A2 restricted) (pMIN.O aniino acid residues 121-129, 
nucleotides 378-405) WLSLLVPFV (SEQ ID NO:107) encoded by nucleotides TGG 
CTG AGC CTG CTG GTG CCC TTT GTG (SEQ ID NO:108). 

The pMIN.O vector contains a Kpnl restriction site (pMIN.O nucleotides 
406-41 1) and a Nhel restriction site (pMIN.O nucleotides 1-6). The pMIN.O vector 
contains a consensus Kozak sequence (nucleotides 7-18) (GCCGCCACCATG; SEQ ID 
NO: 109) and murine Kappa Ig-light chain signal sequence followed by a string of 10 
MHC class I epitopes and one universal MHC class II epitope. The pMIN.O sequence 
encodes an open reading frame fused to the Myc and His antibody epitope tag coded for 
by the pcDNA 3.1 Myc-His vector. The pMIN.O vector was constructed with eight 
oligonucleotides: 

Mini oligo 

GAGGAGCAGAA.^CAGGCTCTGGATCTGCACCTGCATTCCCATGGTGGCGGCGC 
TAGCAAGCTTCTTGCGC (SEQ ID NO:110); 
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Min2 oligo 

CCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGAATCCTGTATA (SEQ ID N0:111); 
Min3 oligo 

5 TCGCTAGGCAGGAAAGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGG 
CCTTATACAGGATTCCGG (SEQ ID NO: 112); 
Min4 oligo 

CTTTCCTGCCTAGCGATTTCTTTCCTAGCGTGAAGCTGACCCCACTGTGCGTGA 
CCCTGTATATGGATGAC (SEQ ID NO: 113); 

10 Min5 oligo 

CGTACCTGGACAGTCCCAGCTTCTCGAAGTTGATGATGCTGGCT 

CCCAGCACCACGTCATCCATATACAG (SEQ ID N0:1 14); 
Min6 oligo 

GGACTGTCCAGATACGTGGCTAGGCTGATCCTGAAGGAGCCTGTGCACGGCGT 
15 GTCCACCCTGCCAGAGAC(SEQIDN0:115); 
Min7 oligo 

GCTCAGCCACTTCCACACAGGCACTCCATAGTACACGGTCCTCCTCACCACGG 

TGGTCTCTGGCAGGGTG (SEQ ID N0:116); 
Min8 oligo 

20 GTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACCTGATCTAGAGC 

(SEQIDN0:117). 

Additional primers were flanking primer 5', GCG CAA GAA GCT TGC 
TAG CG (SEQ ID N0:118) and flanking primer 3', GCT CTAGAT GAG GTACCC 
CAC (SEQ ID NO: 119). 

25 The original pMIN.O minigene construction was carried out using eight 

overlapping oligos averaging approximately 70 nucleotides in length, which were 
synthesized and HPLC purified by Operon Technologies Inc. Each oligo overlapped its 
neighbor by 15 nucleotides, and the final multi-epitope minigene was assembled by 
extending the overlapping oligos in three sets of reactions using PGR (Ho et al, Gene 

30 77:51-59 (1989). 

For the first PGR reaction, 5 ng of each of two oligos were annealed and 
extended: 1+2, 3+4, 5+6, and 7+8 were combined in 100 nl reactions containing 0.25 mM 
each dNTP and 2.5 units of Pfu polymerase in Pfu polymerase buffer containing 10 mM 
KCl, 10 mM (NH4):S04, 20 mM Tris-chloride, pH 8.75, 2 mM MgS04, 0.1% TRITON 
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X-100 and 100 mg/ml BSA. A Perkin/Elmer 9600 PGR machine was used and the 
annealing temperature used was 5°C below the lowest calculated Tm of each primer pair. 
The full length dimer products were gel-purified, and two reactions containing the 
product of 1-2 and 3-4, and the product of 5-6 and 7-8 were mixed, annealed and 
extended for 10 cycles. Half of the two reactions were then mixed, and 5 cycles of 
annealing and extension canied out before flanking primers were added to amplify the 
full length product for 25 additional cycles. The full length product was gel purified and 
cloned into pCR-blunt (Invitrogen) and individual clones were screened by sequencing. 
The Min insert was isolated as an Nhel-Kpnl fragment and cloned into the same sites of 
pcDNA3. l(-)/Myc-His A (Invitrogen) for expression. The Min protein contains the Myc 
and His antibody epitope tags at its carboxyl-terminal end. 

For all the PGR reactions described, a total of 30 cycles were performed 
using Pfu polymerase and the following conditions: 95°G for 15 seconds, annealing 
temperature for 30 seconds, 72°G for one minute. The annealing temperature used was 
5°C below the lowest calculated Tm of each primer pair. 

Three changes to pMIN.O were made to produce pMIN.l (Figure 21; SEQ 
m NO:37, also referred to as pMIN-AOS). The mouse ova epitope was removed, the 
position 9 alanine anchor residue (#547) of HBV pol 551-560 was converted to a valine 
which increased the in vitro binding affinity 40-fold, and a translational stop codon was 
introduced at the end of the multi-epitope coding sequence. The changes were made by 
ampUfying two overlapping fi-agments and combining them to yield the full length 
product. 

The first reaction used the 5' pcDNA vector primer T7 and the primer Min- 
ovaR (nucleotides 247-21 8) TGGACAGTGCCACTGCGAGGAGGAGGTGAT (SEQ ED 
NO: 120). The 3' half was ampUfied with the primers: Min-ovaF (nucleotides 228-257) 
GCTGGGAGTGGGACTGTCCAGGTACGTGGC (SEQ ID NO: 121) and Min-StopR 
(nucleotides 390-361) GGTACGTCACACAAAGGGCACCAGCAGGC (SEQ ID 
NO:122) 

The two fragments were gel purified, mixed, denatured, annealed, and 
filled in with five cycles of PGR. The full length fragment was ampUfied with the 
flanking primers T7 and Min-Stop for 25 more cycles. The product was gel purified, 
digested with Nhel and Kpnl and cloned into pcDNA3.1 for sequencing and expression. 
The insert from pMin. 1 was isolated as an Nhel-Kpnl fragment and cloned into pEP2 to 
makepEP2-A0S. 
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F.XAMPLE TT: Assay for T Helper Cell Activation 

This example shows methods for assaying T helper cell activity. One 
method for assaying T helper cell activity uses spleen cells of an immunized organism. 
Briefly, a spleen cell pellet is suspended with 2-3 ml of red blood cell lysis buffer 
containing 8.3 g/liter ammonium chloride in 0.001 M Tris-HCl, pH 7.5. The cells are 
incubated in lysis buffer for 3-5 min at room teraperature with occasional vortexing. An 
excess volume of 50 ml of RIO medium is added to the cells, and the cells are pelleted. 
The cells are resuspended and pelleted one or two more times in R2 medium or RIO 
medium. 

The cell pellet is suspended in RIO medium and counted. If the cell 
suspension is aggregated, the aggregates are removed by filtration or by allowing the 
aggregates to settle by gravity. The cell concentration is brought to lOVml, and 100 nl of 
spleen cells are added to 96 well flat bottom plates. 

Dilutions of the appropriate peptide, such as pan DR epitope (SEQ ID 
NO:145), are prepared in RIO medium at 100, 10, 1, 0.1 and 0.01 ng/ml, and 100 jil of 
peptide are added to duplicate or triplicate wells of spleen cells. The final peptide 
concentration is 50, 5, 0.5, 0.05 and 0.005 ng/ml. Control wells receive 100 ]i\ RIO 
medium. 

The plates are incubated for 3 days at 37°C. After 3 days, 20 ^1 of 
50 tiCi/ml ^H-thymidine is added per well. Cells are incubated for 1 8-24 hours and then 
harvested onto glass fiber filters. The incorporation of ^H-thymidine into DNA of 
proliferating cells is measured in a beta counter. 

A second assay for T helper cell activity uses peripheral blood 
mononuclear cells (PBMC) that are stimulated in vitro as described in Alexander et al, 
supra and Sette (WO 95/07,707), as adapted from Manca et al, J. Immunol. 146:1964- 
1971 (1991), which is incorporated herein by reference. Briefly, PBMC are collected 
from healthy donors and purified over Ficoll-Plaque (Pharmacia Biotech; Piscataway, 
NJ). PBMC are plated in a 24 well tissue culture plate at 4 x 10^ cells/ml. Peptides are 
added at a final concentration of 10 jig/ml. Cultures are incubated at 37°C in 5% CO2. 

On day 4, recombinant interleukin-2 (IL-2) is added at a final 
concentration of 10 ng/ml. Cultures are fed every 3 days by aspirating 1 ml of medium 
and replacing with fresh medium containing IL-2. Two additional stimulations of the T 
cells with antigen are performed on approximately days 14 and 28. The T cells (3 x 
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lOWll) are stimulated with peptide (10 ng/ml) using autologous PBMC cells (2x10^ 
irradiated cells/well) (irradiated with 7500 rads) as antigen-presenting cells in a total of 
three wells of a 24 well tissue culture plate. In addition, on day 1 4 and 28, T cell 
proliferative responses are determined under the following conditions: 2 x 10* T 

5 cells/well; 1x10^ irradiated PBMC/well as antigen-presenting cells; peptide 

concentration varying between 0.01 and 10 ng/ml final concentration. The proliferation 
of the T cells is measured 3 days later by the addition of ^H-thymidine (1 nCi/well) 18 hr 
prior to harvesting the cells. Cells are harvested onto glass filters and ^H-thymidine 
incorporation is measured in a beta plate counter. These results demonstrate methods for 

1 0 assaying T helper cell activity by measuring ^H-thymidine incorporation. 

F.XAMPLE III: Assav for Cvtotoxic T Lvmr ibncvte Response 

This example shows a method for assaying cytotoxic T lymphocyte (CTL) 
activity. A CTL response is measured essentially as described previously (Vltiello et al, 

15 Eur. J. Immunol. 27:671-678 (1997), which is incorporated herein by reference). Briefly, 
after approximately 10-35 days following DNA immunization, splenocytes from an 
animal are isolated and co-cultured at 37°C with syngeneic, irradiated (3000 rad) peptide- 
coated LPS blasts (1 x 10* to 1.5 x 10* cells/ml) in 10 ml RIO in T25 flasks. LPS blasts 
are obtained by activating splenocytes (1 x 10* to 1.5 x 10* cells/ml) with 25 jig/ml 

20 lipopolysaccharides (LPS) (Sigma cat. no. L-2387; St. Louis, MO) and 7 ng/ml dextran 
sulfate (Pharmacia Biotech) in 30 ml RIO medium in T75 flasks for 3 days at 37'C. The 
lymphoblasts are then resuspended at a concentration of 2.5 x 10^ to 3:0 x lOVml, 
irradiated (3000 rad), and coated with the appropriate peptides (lOOng/ml) for 1 h at 
37°C. Cells are washed once, resuspended in RIO medium at the desired concentration 

25 and added to the responder cell preparation. Cultures are assayed for cytolytic activity on 
day 7 in a ^'Cr-release assay. 

For the ^'Cr-release assay, target cells are labeled for 90 min at 37°C with 
150 ^1 sodium ^'chromate (^^Cr) (New England Nuclear; Wihnington DE), washed three 
times and resuspended at the appropriate concentration in RIO medium. For the assay, 

30 1 0" target cells are incubated in the presence of different concentrations of effector ceUs 
in a final volume of 200 \A in U-bottom 96 well plates in the presence or absence of 10 
Hg/ml peptide. Supematants are removed after 6 h at 37°C, and the percent specific lysis 
is determined by the formula: percent specific lysis = 100 x (experimental release - 
spontaneous release) ^maximum release - spontaneous release). To facilitate comparison 
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of responses from different experiments, the percent release data is transformed to lytic 
units 30 per 10** cells (LUSO/IO**), with 1 LU30 defined as the number of effector cells 
required to induce 30% lysis of 10* target cells in a 6 h assay. LU values represent the 
LUSO/IO** obtained in the presence of peptide minus LU30/10^ in the absence of peptide. 
These results demonstrate methods for assaying CTL activity by measuring ^'Cr release 
from cells. 

EXAMPLE rV: T Cell Proliferation in Mice Immuniz ed vyith Expression Vectors 
Encoding MHC .Class II Epitopes and MHC Class II Ta rgeting Sequences 

This example demonstrates that expression vectors encoding MHC class II 
epitopes and MHC class II targeting sequences are effective at activating T cells. 

The constructs used in the T cell proliferation assay are described in 
Example I and were cloned into the vector pEP2, a CMV driven expression vector. The 
peptides used for T cell in vitro stimulation are: Ova 323-339, ISQAVHAAHAEINEAGR 
(SEQ ID NO: 123); HBVcorel28, TPPAYRPPNAPILF (SEQ ED NO: 124); HBVenvl82, 
FFLLTRILTIPQSLD (SEQ ID NO: 125); and PADRE, AKFVAAWTLKAAA (SEQ ID 
NO:38). 

T cell proliferation was assayed essentially as described in Example II. 
Briefly, 12 to 16 week old B6D2 Fl mice (2 mice per construct) were injected with 100 
^g of the indicated expression vector (50 ^g per leg) in the anterior tibialis muscle. After 
eleven days, spleens were collected from the mice and separated into a single cell 
suspension by Dounce homogenization. The splenocytes were counted and one million 
splenocytes were plated per well in a 96-well plate. Each sample was done in triplicate. 
Ten Jig/ml of the corresponding peptide encoded by the respective expression vectors was 
added to each well. One well contained splenocytes without peptide added for a negative 
control. Cells were cultured at 3TC, 5% CO2 for three days. 

After three days, one [iCi of ^H-thymidine was added to each well. After 
18 hours at 37°C, the cells were harvested onto glass filters and ^H mcoiporation was 
measured on an LKB p plate counter. The results of the T cell proliferation assay are 
shown in Table 9. Antigenspecific T cell proliferation is presented as the stimulation 
index (SI); this is defined as the ratio of the average ^H-thymidine incorporation in the 
presence of antigen divided by the ^H-thymidine incorporation in the absence of antigen. 

The inmiunogen "PADRE + IFA" is a positive control where the PADRE 
peptide in incomplete Freund's adjuvant was injected into the mice and compared to the 
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response seen by injecting the MHC class 11 epitope constructs containing a PADRE 
sequence. As shown in Table 9, most of the expression vectors tested were effective at 
activating T cell proliferation in response to the addition of PADRE peptide. The activity 
of several of the expression vectors was comparable to that seen with immunization with 
the PADRE peptide in incomplete Freund's adjuvant. The expression vectors containing 
both MHC class I and MHC class II epitopes, pEP2-A0S and pcDNA-AOS, were also 
effective at activating T cell proliferation in response totheaddition of PADRE peptide. 

These results show that expression vectors encoding MHC class 11 
epitopes fused to a NfflC class II targeting sequence is effective at activating T cell 
proliferation and are useful for stimulating an immune response. 

EXAMPLE V: Tn vh-o assav Using T ransgenic Mice 
A. Materials and methods 

Peptides were synthesized according to standard F-moc solid phase 
synthesis methods which have been previously described (Ruppert et al, Cell 74:929 
(1993); Sette et al., Mol. Immunol. 31:813 (1994)). Peptide purity was determined by 
analytical reverse-phase HPLC and purity was routinely >95%. Synthesis and 
purification of the Theradigm-HBV lipopeptide vaccine is described in (Vitiello et al, J. 
Clin. Invest. 95:341 (1995)). 

Mice . 

HLA-A2.1 transgenic mice used in this study were the Fl generation 
derived by crossing transgenic mice expressing a chimeric gene consisting of the al, a2 
domains of HLA-A2.1 and a3 domain of H-2K'' with SJL/J mice (Jackson Laboratory, 
Bar Harbor, ME). This strain will be referred to hereafter as HLA-A2.1/K''-H-2'"". The 
parental HLA-Al.l,^" transgenic strain was generated on a C57BL/6 background using 
the transgene and methods described in (Vitiello et al. J. Exp. Med 173:1007 (1991)). 
HLA-Al 1/K'' transgenic mice used in the current study were identical to those described 
in (Alexander et al, J. Immunol 159:4753 (1997)). 

Cell lines. MHC purificatinn. and pep tide binding assav 
Target cells for peptide-specific cytotoxicity assays were Jurkat cells 
transfected with the HLA-A2.1/K'' chimeric gene (Vitiello et al, J. Exp. Med 173:1007 
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(1991)) and .221 tSor cells transfected with HLA-Al I/K" (Alexander et al. J. Immunol. 

159:4753 (1997)). 

To measure presentation of endogenously processed epitopes, Jurkat- 
A2.1/K'' cells were transfected with the pMin.l or pMin.2-GFP minigenes then tested in a 
cytotoxicity assay against epitope-specific CTL lines. For transfection, Jurkat-A2. l/K*" 
cells were resuspended at 10^ cells/ml and 30 ng of DNA was added to 600 ftl of cell 
suspension. After electroporating cells in a 0.4 cm cuvette at 0.25 kV, 960 jiFd, cells 
were incubated on ice for 10 min then cultured for 2 d in RPMI culture medium. Cells 
were then cultured in medium-containing 200_U/ml hygromycin B (Calbiochem, San 
Diego CA) to select for stable transfectants. FACS was used to enrich the fraction of 
green fluorescent protein (GFP)-expressing ceils from 15% to 60% (data not shown). 

Methods for measuring the quantitative binding of peptides to purified 
HLA-A2.1 and -Al 1 molecules is described in Ruppert et al.. Cell 74:929 (1993); Sette et 
al.. Mol Immunol. 31:813 (1994); Alexander et al. J. Immunol. 159:4753 (1997). 

All tumor cell lines and splenic CTLs from primed mice were grown in 
culture medium (CM) that consisted of RPMI 1640 medium with Hepes (Life 
Technologies, Grand Island, NY) supplemented with 10% FBS, 4 mM L-glutamine, 5 X 
10'^ M 2-ME, 0.5 mM sodium pyruvate, 100 ^ig/ml streptomycin, and 100 U/ml 
penicillin. 

Construction of minieene multi-e pitope DNA plasmids 
pMIN.O and pMIN.l (i.e., pMIN-AOS) were constructed as described 
above and in USSN 60/085,75 1 . 

pMin. 1 -No PADRE and pMin. 1 -Anchor. pMin.l was amplified using two 
overiapping fragments which was then combined to yield the full length product. The 
first reaction used the 5 ' pcDNA vector primer TV and either primer 
ATCGCTAGGCAGGAACTTATACAGGATTCC(SEQ ID NO:126) for pMin.l-No 
PADRE or TGGACAGTCCGGCTCCCAGCACCACGT (SEQ ID NO: 127) for pMin.l- 
Anchor. The 3' half was amplified with the primers TTCCTGCCTAGCGATTTC (SEQ 
E) NO:128) (No PADRE) or GCTGGGAGCCGGACTGTCCAGGTACGT (SEQ ID 
NO:129) (Anchor) and Min-StopR. The two fragments generated from amplifying the 5' 
and 3' ends were gel purified, mixed, denatured, annealed, and filled in with five cycles 
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of PCR. The full length fragment was funner amplified with the flanking primers T7 and 
Min-StopR for 25 more cycles. 

pMin.l-No Sis. The Ig signal sequence was deleted from pMin.l by PCR 
amplification with primer GCTAGCGCCGCCACCATGCACACCCTGTGGAAGGC 
CGGAATC (SEQ ID NO:130) and pcDNA rev (Invitrogen) primers. The prpduct was 
cloned into pCR-blunt and sequenced. 

- .__ .... pMin.l -Switch. -Three overlapping fragments were amplified from 
pMin.l, combined, and extended. The 5' fragment was amplified with the vector primer 
T7 and primer GGGC.\CCAGCAGGCTCAGCCACACTCCCAGCACCACGTC (SEQ 
E) NO: 131). The second overlapping fragment was amplified with primers 
AGCCTGCTGGTGCCCTTTGTGATCCTGAAGGAGCCTGTGC (SEQ ID NO: 132) 
and AGCCACGTACCTGGACAGTCCCTTCCACACAGGCACTCCAT (SEQ ID 
NO:133). Primer TGTCCAGGTACGTGGCTAGGCTGTGAGGTACC (SEQ ED 
NO:134) and the vector primer pcDNA rev (Invitrogen) were used to amplify the third 
(3') fragment. Fragments 1, 2, and 3 were amplified and gel purified. Fragments 2 and 3 
were mixed, annealed, amplified, and gel purified. Fragment 1 was combined with the 
product of 2 and 3, and extended, gel purified and cloned into pcDNA3.1 for expression. 

pMin.2-GFP. The signal sequence was deleted from pMin.O by PCR 
amplification with Min.O-No Sig-5' plus pcDNA rev (Invitrogen) primers 
GCTAGCGCCGCCACCATGCACACCCTGTGGAAGGCCGGAATC (SEQ ID 
NO: 13 5). The product was cloned into pCR-blunt and sequenced. The insert containing 
the open reading frame of the signal sequence-deleted multi-epitope construct was cut out 
with Nhel plus Hindm and ligated into the same sites of pEGFPN 1 (Clontech). This 
construct fuses the coding region of the signal-deleted pMin.O construct to the N-terminus 
of green fluorescent protein (GFP). 

Immunization of mice 

For DNA immunization, mice were pretreated by injecting 50 ^1 of 10 ^iM 
cardiotoxin (Sigma Chem. Co., #C9759) bilaterally into the tibialis anterior muscle. Four 
or five days later, 100 ng of DNA diluted in PBS were injected in the same muscle. 
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Theradigm-HBV lipopeptide (10 mg/ml in DMSO) that was stored at - 
20°C, was thawed for 10 min at 45°C before being diluted 1:10 (v/v) with room 
temperature PBS. Immediately upon addition of PBS, the Hpopeptide suspension was 
vortexed vigorously and 100 \il was injected s.c. at the tail base (100 ^g/mouse). 

Immunogenicity of individual CTL epitopes was tested by mixing each 
CTL epitope (50 ng/mouse) with the HBV core 128-140 peptide (TPPAYRPPNAPIL 
(SEQ ID NO:124), 140 ^ig/mouse) which served to induce I-A''-restricted Th cells. The 
peptide cocktail was then emuslifed in incomplete Freund's adjuvant (Sigma Chem. Co.) 
and 100 nl of peptide-emulsion was injected s.c. at the tail.base. 



Eleven to 14 days after inraiunization, animals were sacrificed and a single 
cell suspension of splenocytes prepared. Splenocytes from cDNA-primed animals were 
stimulated in vitro with each of the peptide epitopes represented in the minigene. 
Splenocytes (2.5-3.0 X lO'/flask) were cultured in upright 25 cm^ flasks in the presence 
of 10 ^g/ml peptide and lO' irradiated spleen cells that had been activated for 3 days with 
LPS (25 tig/ml) and dextran sulfate (7 ng/ml). Triplicate cultures were stimulated vwth 
each epitope. Five days later, cultures were fed with fresh CM. After 10 d of in vitro 
culture, 2-4 X 10^ CTLs from each flask were restimulated with 10^ LPS/dextran sulfate- 
activated splenocytes treated with 100 ng/ml peptide for 60-75 min at 37<'C, then 
irradiated 3500 rads. CTLs were restimulated in 6-well plates in 8 ml of cytokine-free 
CM. Eighteen hr later, cultures received cytokines contained in con A-activated 
splenocyte supernatant (10-15% final concentration, v/v) and were fed or expanded on the 
third day with CM containing 10-15% cytokine supemate. Five days after restimulation, 
CTL activity of each culture was measured by incubating varying numbers of CTLs with 
10" ^'Cr-labelled target cells in the presence or absence of peptide. To decrease 
nonspecific cytotoxicity from NK cells, YAC-1 cells (ATCC) were also added at a YAC- 
l:^'Cr-labeled target cell ratio of 20:1. CTL activity against the HBV Pol 551 epitope 
was measured by stimulating DNA-primed splenocytes in vitro with the native A- 
containing peptide and testing for cytotoxic activity against the same peptide. 

To more readily compare responses, the standard E:T ratio vs % 
cytotoxicity data cur\'es were converted into LU per 10** effector cells with one LU 
defined as the lytic activity required to achieve 30% lysis of target cells at a 100:1 E:T 



In vitro CTL cultures an d cytotoxicitv assays 
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ratio. Specific CTL activity (ALU) was calculated by subtracting the LU value obtained 
in the absence of peptide frona the LU value obtained with peptide. A given culture was 
scored positive for CTL induction if all of the following criteria were met: 1) ALU >2; 2) 
LU(+ peptide) h- LU(- peptide) > 3; and 3) a >10% difference in % cytotoxicity tested 
with and without peptide at the two highest E:T ratios (starting E:T ratios were routinely 

between 25-50:1). 

CTL lines were generated from pMin.l -primed splenocytes through 
repeated weekly stimulations of CTLs with peptide-treated LPS/DxS-activated 
splenocytes using the 6-well culture conditions described above with the exception that 
CTLs were expanded in cytokine-containing CM as necessary during the seven day 
stimulation period. 

Cytokine assay 

To measure IFN-y production in response to minigene-transfected target 
cells, 4X10* CTLs were cultured with an equivalent number of minigene-transfected 
Jurkat-A2.1/K'' cells in 96-well flat bottom plates. After overnight incubation at 3TC, 
culture supernatant from each well was collected and assayed for IFN-y concentration 
using a sandwich ELISA. Immulon II microtiter wells (Dynatech, Boston, MA) were 
coated overnight at A^C with 0.2 \ig of anti-mouse IFN-y capture Ab, R4-6A2 
(Pharmingen). After washing wells with PBS/0.1% Tween-20 and blocking with 1% 
BSA, Ab-coated wells were incubated with culture supemate samples for 2 hr at room 
temperature. A secondary anti-IFN-y Ab, XMG1.2 (Pharmingen), was added to wells and 
allowed to incubate for 2 hr at room temperature. Wells were then developed by 
incubations with Avidin-DH and finally with biotinylated horseradish peroxidase H 
(Vectastain ABC kit, Vector Labs, Burlingame, CA) and TMB peroxidase substrate 
(Kirkegaard and Perry Labs, Gaithersberg, MD). The amount of cytokine present in each 
sample was calculated using a rIFN-y standard (Pharmingen). 

b. Results 

Selection of epitopes and niini o [ene constru ct design 
In the first series of experiments, the issue was whether a balanced 
multispecific CTL response could be induced by simple minigene cDNA constructs that 
encode several dominant HLA class I-restricted epitopes. Accordingly, nine CTL 
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epitopes were chosen on the basis of their relevance in CTL immunity during HBV and 
HIV infection in humans, their sequence conservancy among viral subtypes, and their 
class I MHC binding affinity (Table 10). Of these nine epitopes, six are restricted by 
HLA-A2.1 and three showed HLA-Al 1 -restriction. One epitope, HBV Pol 551, was 
studied in two alternative forms: either the wild type sequence or an analog (HBV Pol 
551-V) engineered for higher binding affinity. 

As referenced in Table 10, several independent laboratories have reported 
that these epitopes are part of the dominant CTL response during HBV or HIV infection. 
All of the epitopes considered-showed greater than 75% conservancy in primary amino 
acid sequence among the different HBV subtypes and HIV clades. The MHC binding 
affinity of the peptides was also considered in selection of the epitopes. These 
experiment addressed the feasibility of immunizing with epitopes possessing a wide range 
of affinities and, as shown in Table 10, the six HBV and three HIV HLA-restricted 
epitopes covered a spectrum of MHC binding affinities spanning over two orders of 
magnitude, with IC5o% concentrations ranging from 3 nM to 200 nM. 

The immunogenicity of the six A2.1- and three Al 1 -restricted CTL 
epitopes in transgenic mice was verified by co-immunization with a helper T cell peptide 
in an IFA formulation. All of the epitopes induced significant CTL responses in the 5 to 
73 ALU range (Table 10). As mentioned above, to improve the MHC binding and 
immunogenicity of HBV Pol 551, the C-terminal A residue of this epitope was substituted 
with V resulting in a dramatic 40-fold increase in binding affinity to HLA-A2.1 (Table 
10). While the parental sequence was weakly or nonimmunogenic in HLA transgenic 
mice, the HBV Pol 551-V analog induced significant levels of CTL activity when 
administered in IFA (Table 10). On the basis of these results, the V analog of the HBV 
Pol 551 epitope was selected for the initial minigene construct. In all of the experiments 
reported herein, CTL responses were measured with target cells coated with the native 
HBV Pol 551 epitope, irrespective of whether the V analog or native epitope was utilized 
for immunization. 

Finally, since previous studies indicated that induction of T cell help 
significantly improved the magnitude and duration of CTL responses (Vitiello et al, 1 
Clin. Invest 95:341 (1995); Livingston et a/., 1 Immunol 159:1383 (1997)), the universal 
Th cell epitope PADRE was also incorporated mto the minigene. PADRE has been 
shown previously to have high MHC binding affinity to a wide range of mouse and 
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human MHC clas^haplotypes (Alexander et ai, Immunity 1:751 (1994)). In particular, 
it has been previously shown that PADRE is highly immunogenic in H-2'' mice that are 
used in the current study (Alexander et ai, Immunity 1:751 (1994)). 

pMin.l, the prototype cDNA minigene construct encoding nine CTL 
epitopes and PADRE, was synthesized and subcloned into the pcDNA3.1 vector. The 
position of each of the nine epitopes in the minigene was optimized to avoid junctional 
mouse H-2"' and HLA-A2.1 class I MHC epitopes. The mouse Ig k signal sequence was 
also included at the 5' end of the construct to facilitate processing of the CTL epitopes in 
the endoplasmic reticulum (ER) as reported by others (Anderson et ai, J. Exp. Med. 
174:489 (1991)). To avoid further conformational structure in the translated polypeptide 
gene product that may affect processing of the CTL epitopes, an ATG stop codon was 
introduced at the 3' end of the minigene construct upstream of the coding region for c- 
myc and poly-his epitopes in the pcDNA3. 1 vector. 

Tmmunogenicitv of pMin.l in transgenic mice 

To assess the capacity of the pMin.l minigene construct to induce CTLs in 
vivo, HLA-A2.1/k''-H-2'"" transgenic mice were immunized intramuscularly with 100 ng 
of naked cDNA. As a means of comparing the level of CTLs induced by cDNA 
immunization, a control group of animals was also immunized with Theradigm-HBV, a 
palmitolyated lipopeptide consisting of the HBV Core 18 CTL epitope linked to the 
tetanus toxin 830-843 Th cell epitope. 

Splenocytes from immunized animals were stimulated twice with each of 
the peptide epitopes encoded in the minigene, then assayed for peptide-specific cytotoxic 
activity in a ^'Cr release assay. A representative panel of CTL responses of pMin.l- 
primed splenocytes, shown in Figure 22, clearly indicates that significant levels of CTL 
induction were generated by minigene immunization. The majority of the cultures 
stimulated with the different epitopes exceeded 50% specific lysis of target cells at an E:T 
ratio of 1 : 1 . The results of four independent experiments, compiled in Table 1 1 , indicate 
that the pMin.l construct is indeed highly immunogenic in HLA-A2.1/K''-H-2'"'^ 
transgenic mice, inducing a broad CTL response directed against each of its six A2.1- 

restricted epitopes. 

To more conveniently compare levels of CTL induction among the 
different epitopes, the % cytotoxicity values for each splenocyte culture was converted to 



- 57 - 



wo 99/58658 




PCT/US99/10646 



ALU and the mean ALU of CTL activity in positive cultures for each epitope was 
determined {see Example V, materials and methods, for positive criteria). The data, 
expressed in this manner in Table 1 1, confirms the breadth of CTL induction elicited by 
pMm.l immunization since extremely high CTL responses, ranging between 50 to 700 
ALU, were observed against the six A2.1 -restricted epitopes. More significantly, the 
responses of several hundred ALU observed for five of the six epitopes approached or 
exceeded that of the Theradigm-HBV lipopeptide, a vaccine formulation known for its 
high CTL-inducing potency (Vitiello et al, J. Clin. Invest. 95:341 (1995); Livingston et 
al, J: Immunol'\59-Am (1997)). The HBV Env 335 epitope was the only epitope 
showing a lower mean ALU response compared to lipopeptide (Table 1 1, 44 vs 349 
ALU). 

Processing of minigene epitopes bv transfected cells 
The decreased CTL response observed against HBV Env 335 was 
somewhat unexpected since this epitope had good A2.1 binding affinity (IC50%, 5 nM) 
and was also immunogenic when administered in EFA. The lower response may be due, 
at least in part, to the inefficient processing of this epitope fi-om the minigene polypeptide 
by antigen presenting cells following in vivo cDNA immunization. To address this 
possibility, Jurkat-A2.1.X*' tumor cells were transfected with pMin.l cDNA and the 
presentation of the HB\* Env 335 epitope by transfected cells was compared to more 
immunogenic A2.1 -restricted epitopes using specific CTL lines. Epitope presentation 
was also studied using mmor cells transfected with a control cDNA construct, pMin.2- 
GFP, that encoded a similar multi-epitope minigene fused with GFP which allows 
detection of minigene expression in transfected cells by FACS. 

Epitope presentation of the transfected Jurkat cells was analyzed using 
specific CTL lines, with cytotoxicity or IFN-y production serving as a read-out. It was 
found that the levels of CTL response correlated directly with the in vivo immunogenicity 
of the epitopes. Highl}* immunogenic epitopes in vivo, such as HBV Core 18, HIV Pol 
476, and HBV Pol 455, were efficiently presented to CTL lines by pMin.l- or pMin.2- 
GFP-transfected cells as measured by IFN-y production (Figure 23 A, >100 pg/ml for each 
epitope) or cytotoxic activity (Figure 23C, >30% specific lysis). In contrast to these high 
levels of w vitro activit>', the stimulation of the HBV Env 335-specific CTL line against 
both populations of transfected cells resulted in less than 12 pg/ml IFN-y and 3% specific 
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lysis. Although the HBV Env 335-specific CTL line did not recognize the naturally 
processed epitope efficiently, this Hne did show an equivalent response to peptide-loaded 
target cells, as compared to CTL lines specific for the other epitopes (Figure 23B, D). 
Collectively, these results suggest that a processing and/pr presentation defect associated 
with the HBV Env 335 epitope that may contribute to its diminished immunogencity in 
vivo. 

Effect of the helper T cell epitope PADRE on minisene inun unogenicitv 

Having obtained a broad and balanced CTL response in transgenic mice 

immunized with a minigene cDNA encoding multiple HLA-A2.1 -restricted epitopes, next 
possible variables were examined that could influence the iinmunogenicity of the 
prototype construct. This type of analysis could lead to rational and rapid optimization of 
future constructs. More specifically, a cDNA construct based on the pMin.l prototype 
was synthesized in which the PADRE epitope was deleted to examine the contribution of 
T cell help in minigene immunogenicity (Figure 24A). 

The results of the immunogenicity analysis indicated that deletion of the 
PADRE Th cell epitope resulted in significant decreases in the firequency of specific CTL 
precursors against four of the minigene epitopes (HBV Core 1 8, HIV Env 120, HBV Pol 
455, and HBV Env 335) as indicated by the 17 to 50% CTL-positive cultures observed 
against these epitopes compared to the 90-100% firequency in animals immunized with 
the prototype pMin.l construct (Figure 25). Moreover, for two of the epitopes, HBV 
Core 18 and HTV Env 120, the magnitude of response in positive cultures induced by 
pMin.l-No PADRE was 20- to 30-fold less than that of the pMin.l construct (Figure 
25A). 

Effect of modulafion of MHC binding affinity on epitope immunogenicity 
Next a construct was synthesized in which the V anchor residue in HBV 
Pol 551 was replaced with alanine, the native residue, to address the effect of decreasing 
MHC binding on epitope immunogenicity (Figure 24B). 

Unlike deletion of the Th cell epitope, decreasing the MHC binding 
capacity of the HBV Pol 551 epitope by 40-fold through modification of the anchor 
residue did not appear to affect epitope immunogenicity (Figure 25B). The CTL response 
against the HBV Pol 55 1 epitope, as well as to the other epitopes, measured either by LU 
or firequency of CTL-positive cultures, was very similar between the constructs 
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containing the native A or improved V residue at the MHC binding anchor site. This 
finding reinforces the notion that minimal epitope minigenes can efficiently dehver 
epitopes of vastly different MHC binding affinities. Furthermore, this finding is 
particularly relevant to enhancing epitope immunogenicity via different delivery methods, 
5 especially in light of the fact that the wild type HBV Pol 55 1 epitope was essentially 
nonimmunogenic when delivered in a less potent IFA emulsion. 

Effect of the signal sequence on minieene constru ct immunogenicity 
The signal sequence was deleted from the pMin.l construct, thereby 

10 preventing processing of the minigene polypeptide in the ER (Figure 24C). When the 
immunogenicity of the pMin.l-No Sig construct was examined, an overall decrease in 
response was found against four CTL epitopes. Two of these epitopes, HIV Env 120 and 
HBV Env 335, showed a decrease in frequency of CTL-positive cultures compared to 
pMin.l while the remaining epitopes, HBV Pol 455 and fflV Pol 476, showed a 16-fold 

15 (from 424 to 27 ALU) and 3-fold decrease (709 to 236 ALU) in magnitude of the mean 
CTL response, respectively (Figure 25C). These findings suggest that allowing ER- 
processing of some of the epitopes encoded in the pMin.l prototype construct may 
improve immunogenicity, as compared with constructs that allow only cytoplasmic 
processing of the same panel of epitopes. 

20 

Effect of epitope rearrangement and creation of new junctional epitopes 
In the final construct tested, the immunogenicity of the HBV Env 335 ■ 
epitope was analyzed to determine whether it may be influenced by its position at the 3' 
terminus of the minigene construct (Figure 24D). Thus, the position of the Env epitope in 

25 the cDNA construct was switched with a more immunogenic epitope, HBV Pol 455, 
located in the center of the minigene. It should be noted that this modification also 
created two potentially new epitopes. As shown in Figure 25D, the transposition of the 
two epitopes appeared to affect the immunogenicity of not only the transposed epitopes 
but also more globally of other epitopes. Switching epitopes resulted in obliteration of 

30 CTL induction against HBV Env 335 (no positive cultures detected out of six). The CTL 
response induced by the terminal HBV Pol 455 epitope was also decreased but only 
slightly (424 vs 78 mean ALU). In addition to the switched epitopes, CTL induction 
against other epitopes in the pMin.l -Switch construct was also markedly reduced 
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compared to the prototv-pe construct. For example, a CTL response was not observed 
against the HIV Env 120 epitope and it was significantly diminished against the HBV 
Core 18 (4 of 6 positive cultures, decrease in mean ALU from 306 to 52) and HBV Pol 
476 (decrease in mean ALU from 709 to 20) epitopes (Figure 25D). 

As previously mentioned, it should be noted that switching the two 
epitopes had created new junctional epitopes. Indeed, in the pMin. 1 -Switch construct, 
two new potential CTL epitopes were created from sequences of HBV Env 335-HIV Pol 
476 (LLVPFVIL (SEQ ID NO:135), H-2K''-restricted) and HBV Env 335-HBV Pol 551 
(VLGVWLSLLV (SEQ DD NO: 136), HLA-A2.1 -restricted) epitopes. Although these 
junctional epitopes have not been examined to determine whether or not they are indeed 
immunogenic, this may account for the low immunogenicity of the HBV Env 335 and 
HIV Pol 476 epitopes. These findings suggest that avoiding junctional epitopes may be 
important in designing multi-epitope minigenes as is the ability to confirm their 
immunogenicity in vivo in a biological assay system such as HLA transgenic mice. 

Induction of CTLs against Al 1 epi tppes encoded in pMin.l 
To .further examine the flexibility of the minigene vaccine approach for 
inducing a broad CTL response against not only multiple epitopes but also against 
epitopes restricted by different HLA alleles, HLA-Al l/K** transgenic mice were 
immunized to determine whether the three Al 1 epitopes in the pMin.l construct were 
immunogenic for CTLs, as was the case for the A2.1 -restricted epitopes in the same 
construct. As summarized in Table 12, significant CTL induction was observed in a 
majority of cultures against all three of the HLA-Al 1 -restricted epitopes and the level of 
CTL immunity induced for the three epitopes, in the range of 40 to 260 ALU, exceeded 
that of peptides delivered in IF A (Table 10). Thus, nine CTL epitopes of varying HLA 
restrictions incorporated into a prototype minigene construct all demonstrated significant 
CTL induction in vivo, confirming that minigene DNA plasmids can serve as means of 
delivering multiple epitopes, of varying HLA restrictions and MHC binding affinities, to 
the inunune system in an immunogenic fashion and that appropriate transgenic mouse 
strains can be used to measure DNA construct immunogenicity in vivo. 

CTLs were also induced against three Al 1 epitopes in Al l/K*' transgenic 
mice. These responses suggest that minigene delivery of multiple CTL epitopes that 
confers broad population coverage may be possible in humans and that transgenic animals 
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of appropriate haplotypes may be a useful tools in optimizmg the in vivo immunogemcity 
of minigene DNA. In addition, animals such as monkeys having conserved HLA 
molecules with cross reactivity to CTL and HTL epitopes recognized by human MHC 
molecules can be used to determine human immunogenicity of HTL and CTL epitopes 

5 (Bertoni et al, J. Immunol. \(>\ :4447-4455 (1998)). 

This study represents the first description of the use of HLA transgenic 
mice to quantitate the in vivo immunogenicity of DNA vaccines, by examining response 
to epitopes restricted by human HLA antigens. In vivo studies are required to address the 
variables crucial for vaccine development, thaf are not easily evaluated by in vitro assays, 

1 0 such as route of administration, vaccine formulation, tissue biodistribution, and 

involvement of primary and secondary lymphoid organs. Because of its simplicity and 
flexibility, HLA transgenic mice represent an attractive alternative, at least for initial 
vaccine development smdies, compared to more cumbersome and expensive studies in 
higher animal species, such as nonhuman primates. The in vitro presentation studies 

15 described aboye further supports the use of HLA transgenic mice for screening DNA 
constructs containing human epitopes inasmuch as a direct correlation between in vivo 
immunogenicity and in vitro presentation was observed. Finally, strong CTL responses 
were observed against all six A 2.1 restricted viral epitopes and in three Al 1 restricted 
epitopes encoded in the prototype pMin.l construct. For five of the A 2.1 restricted 

20 epitopes, the magnitude of CTL response approximated that observed with the 

lipopeptide, Theradigm-HBV, that previously was shown to induce strong CTL responses 
in humans (Vitiello et al.. J. Clin. Invest. 95:341 (1995); Livingston et ai. J. Immunol. 
159:1383 (1997)). 
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10 



15 




Table 9. Activation of T Cell Proliferation by Expression 
Vectors Encoding MHC Class II Epitopes Fused to MHC 
Class II Targeting Sequences 



Immunogen Stimulating Peptide' 



PADRE OVA 323 CORE 128 



peptide -CFA^ 


3.0(1.1) 


2.7 (1.2) 


3.2 (1.4) 


pEP2.(PA0S).(-) 








pEP2.(A0S).(-) 


5.6(1.8) 






pEP2.(PA0S).(sigTh) 


5.0 (2.9) 




2.6(1.5) 


pEP2.(PA0S).(IgaTh) 


5.6(2.1) 




3.0(1.6) 


pEP2.(PA0S).(LampTh) 


3.8(1.7) 




3 


pEP2.(PA0S).(IiTh) 


5.2 (2.0) 


3.2 (1.5) 


3.7(1.5) 


pEP2.(PAOS).(H2M) 


3.3 (1.3) 




2.8 



'Geometric mean of cultures with SI ^ 2. 
^Proliferative response measured in the lymph node. 
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Table 10 

CTL Epitopes in cDNA Minigene 

Immunogenicity In Vivo (IFA) 



Epitope 




MHC 
Restrict. 


MHC 
Binding 
Affinity 


No. CTL- 

Positive 
Cultures 


CTL Response 

(Geo. Mean 
x/^SD) ^ 












ALU 


HB V Core 1 o 


rLrroUrrro V 


A7 1 

A^. 1 


3 


6/6 


73.0(1.1) 


HBV Env 335 


WLSLLVPFV 


All 


5 


4/6 


5.3(1.6) 


HBV Pol 455 


GLSRYVARL 


All 


76 


ND' 


ND 


HIV Env 120 


KLTPLCVTL 


All 


102 


2/5 


6.4(13) 


HIV Pol 476 


ILKEPVHGV 


A2.1 


192 


2/5 


15.2 (2.9) 


HBV Pol55l-A 


YMDDWLGA 


All 


200 


0/6 




HBV Pol551-V 


YMDDWLGV 


All 


5 


6/6 


8.2 (2.3) 


HIV Env 49 


TVYYGVPVWK 


All 


4 


28/33 


13.4 (3.1) 


HBV Core 141 


STLPETTWRR 


All 


4 


6/6 


12.1 (2.6) 


HBV Pol 149 


HTLWKAGILYK 


All 


14 


6/6 


13.1 (1.2) 



a Peptide tested in HLA-A2. l/K^ H-2 transgenic mice by co-immunizing with a T helper cell peptide in IFA. 
b Geometric mean CTL response of positive cultures, 
c ND, not done. 
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Table 11 

Summary of Immunogenicity of pMin.l DNA 
construct in HLA A2.1/K'' transgenic mice 

CTL Response ^ 



Epitope 


[NO. rOSUlVC 

Cultures/Total ^ 


Cipn Mpan l^e^nnnse Positive 

Cultures [x/+SD] 








HBV Core 18 


9/9 


455.5 [2.2] 


HIVEnvl20 


12/12 


211.9 [3.7] 


HBV Pol551-V 


9/9 


126.1 [2.8] 


HBV Pol 455 


12/12 


738.6 [1.3] 


HIV Pol 476 


11/11 


716.7 [1.5] 


HBV Env 335 


12/12 


43.7 [1.8] 


HBV Core IS 
(Theradiem)' 


10/10 


349.3 [1.8] 



^ Mice were immunized with pMin. 1 DNA or Theradigm-HB V lipopeptide and CTL 
activity in splenoc>le cultures was determined after in vitro stimulation with 
5 individual peptide epitopes. Results from four independent experiments are shown. 

^ See Example V, Materials and Methods for definition of a CTL-positive culture. 

' Response of mice immunized with Theradigm-HB V lipopeptide containing the HBV 
Core 18 epitope. 
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Table 12 
Summary of immunogenicity 
in HLA Al l/K*" transgenic mice 



CTL Response' 



Epitope 


No. Positive 
Cultures/Total'' 


Geo. Mean Response 
Positive Cultures [x/-r SD] 






ALU 


HBV Core 141 


5/9 


128.1 [1.6] 


HBV Pol 149 


6/9 


267.1 [2.2] 


HIV Env 43 


9/9 


40.1 [2.91 



" Mice were immunized with pMin. 1 DNA and CTL activity in splenocyte cultures was 
determined after in vitro stimulation with individual Al 1 -restricted epitopes. The 
5 geometric mean CTL response from three independent experiments are shovim. 

" Definition of a CTL-positive culture is described in Example V, Materials and 
Methods. 
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1 1 . An expression vector comprising a promoter operably linked to a 

2 first nucleotide sequence encoding a major histocompatibility (MHC) targeting sequence 

3 fused to a second nucleotide sequence encoding two or more heterologous peptide 

4 epitopes, wherein the heterologous peptide epitopes comprise two HTL peptide epitopes 

5 or a CTL peptide epitope and a universal HTL peptide epitope, 

1 2 . The expression vector of claim 1 , wherein the heterologous peptide 

2 epitopes comprise two or more heterologous HTL peptide epitopes. 

1 3 . The expression vector of claim 1 , wherein the heterologous peptide 

2 epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 

1 4. The expression vector of claim 2, wherein the heterologous peptide 

2 epitopes further comprise one or more CTL peptide epitopes. 

1 5. The expression vector of claim 3, wherein the heterologous peptide 

2 epitopes further comprise two or more CTL peptide epitopes. 

1 6. The expression vector of claim 3, wherein the heterologous peptide 

2 epitopes further comprise two or more HTL peptide epitopes. 

1 7. The expression vector of claim 2, wherein one of the HTL peptide 

2 epitopes is a universal HTL epitope. 

1 8. The expression vector of claim 3 or 7, wherein the universal HTL 

2 epitope is a pan DR epitope. 

1 9. The expression vector of claim 8 , wherein the pan DR epitope has 

2 the sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

1 10. The expression vector of claim 1 , wherein the peptide epitopes are 

2 hepatitis B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 
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1 11. The expression vector of claim 10, wherein the peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 Tables 1-8. 

1 1 2. The expression vector of claim 1 1 , wherein at least one of the 

2 peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

1 13. The expression vector of claim 1 , wherein the MHC targeting 

2 sequence comprises a region of a polypeptide selected from the group consisting of the li 

3 protein, LAMP-I, HLS-DM, HLA-DO, H2-D0,. influenza matrix protein, hepatitis B 

4 surface antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-P protein, and 

5 Ig kappa chain signal sequence. 

1 14. The expression vector of claim 1 , wherein the expression vector 

2 further comprises a second promoter sequence operably linked to a third nucleotide 

3 sequence encoding one or more heterologous HTL or CTL peptide epitopes. 

1 15. The expression vector of claim 1 , wherein the vector comprises 

2 pMinl orpEP2. 

1 1 6. The expression vector of claim 3 or 4, wherein the CTL peptide 

2 epitope comprises a structural motif for an HLA supertype, whereby the peptide CTL 

3 epitope binds to two or more members of the supertype with an affinity of greater that 

4 500 nM. 

1 1 7. The expression vector of claim 4 or 5, wherein the CTL peptide 

2 epitopes have structural motifs that provide binding affinity for more than one HLA allele 

3 supertype. 

1 1 8 . A method of inducing an immune response in vivo comprising 

2 administering to a mammalian subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibility (MHC) 

4 targeting sequence fiised to a second nucleotide sequence encoding two or more 

5 heterologous peptide epitopes, wherein the heterologous peptide epitopes comprise two 

6 HTL peptide epitopes or a CTL peptide epitope and a universal HTL peptide epitope. 
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1 
2 

1 
2 

1 
2 

1 
2 

1 
2 

1 
2 

1 
2 
3 
4 

1 
2 



19. The method of claim 1 8, wherein the heterologous peptide epitopes 
comprise two or more heterologous HTL peptide epitopes. 

20. The method of claim 1 8, wherein the heterologous peptide epitopes 
comprise a CTL peptide epitope and a universal HTL peptide epitope. 

21. The method of claim 1 9, wherein the heterologous peptide epitopes 
further comprise one or more CTL peptide epitopes. 

22. The method of claim 20, wherein the heterologous peptide epitopes 
further comprise two or more CTL peptide epitopes. 

23. The method of claim 20, wherein the heterologous peptide epitopes 
further comprise two or more HTL peptide epitopes. 

.24. The method of claim 1 9. wherein the HTL peptide epitope is a 
universal HTL epitope. 

25. The method of claim 20 or 24, wherein the universal HTL epitope 
is a pan DR epitope. 

26. The method of claim 25, wherein the pan DR epitope has the 
sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

27. The method of claim 1 8, wherein the peptide epitopes are hepatitis 
B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus epitopes, 
human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PAP epitopes, PSM 
epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

28. The method of claim 27, wherein the peptide epitopes each have a 
sequence selected from the group consisting of the peptides depicted in Tables 1-8. 

29. The method of claim 28, wherein least one of the peptide epitopes 
is an analog of a peptide depicted in Tables 1-8. 

30. The method of claim 18, wherein the MHC targeting sequence 
comprises a region of a polypeptide selected from the group consisting of the li protein, 
LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 
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4 antigen, hepati^^virus core antigen, Ty particle, Ig-a protein, I^^rotein, and Ig 

5 kappa chain signal sequence. 

1 31. The method of claim 1 8, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous HTL or CTL peptide epitopes. 

1 32. The method of claim 1 8, wherein the vector comprises pMin. 1 or 

2 pEP2. 

1 33. The method of claim 20 or 21, wherein the CTL peptide epitope 

2 comprises a structural motif for an HLA supertype, whereby the peptide epitope binds to 

3 two or more members of the supertype with an affinity of greater that 500 nM. 

1 34. The method of claim 2 1 or 22, wherein the CTL peptide epitopes 

2 have structural motifs that provide binding affinity for more than one HLA allele 

3 supertype. 

1 3 5 . A method of inducing an immune response in vivo comprising 

2 administering to a mammalian subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibility (MHC) 

4 targeting sequence fused to a second nucleotide sequence encoding a heterologous human 

5 HTL peptide epitope. 

1 36. The method of claim 35, wherein the second nucleotide sequence 

2 further comprises two or more heterologous HTL peptide epitopes. 

1 37. The method of claim 35, wherein the second nucleotide sequence 

2 further comprises one or more heterologous CTL peptide epitopes. 

1 38. The method of claim 35, wherein the HTL peptide epitope is a 

2 universal HTL peptide epitope 

1 39. The method of claim 38, wherein the universal HTL epitope is a 

2 pan DR epitope. 

1 40. The method of claim 39, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 
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1 41 . The method of claim 37, wherein the HTL and CTL peptide 

2 epitopes are hepatitis B virus epitopes, hepatitis C virus epitopes, human 

3 immunodeficiency virus epitopes, human papilloma virus epitopes, MAGE epitopes, PSA 

4 epitopes, PAP epitopes, PSM epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, 

5 or Plasmodium epitopes. 

1 42. The method of claim 41, wherein the peptide epitopes each have a 

2 sequence selected from the group consisting of the peptides depicted in Tables 1-8. 

1 43. The method of claim 42, wherein at least cine of the peptide 

2 epitopes is an analog of a peptide depicted in Tables 1 -8. 

1 44. The method of claim 35, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 

3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 

4 antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-p protein, and Ig 

5 kappa chain signal sequence. 

1 45 . The method of claim 3 5 , wherein the expression vector fiirther 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous HTL or CTL peptide epitopes. 

1 46. The method of claim 37, wherein the CTL peptide epitope 

2 comprises a structural motif for an HLA supertype, whereby the peptide epitope binds to 

3 two or more members of the supertype with an affinity of greater that 500 nM. 

1 47. The method of claim 37, wherein the CTL peptide epitopes have 

2 structural motifs that provide binding affmity for more than one HLA allele supertype. 

1 48. A method of assaying the human immunogenicity of a human T 

2 cell peptide epitope in vivo in a non-human mammal, comprising the step of 

3 administering to the non-human mammal an expression vector comprising a promoter 

4 operably linked to a first nucleotide sequence encoding a heterologous human CTL or 

5 HTL peptide epitope. 
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1 49. The method of claim 48, wherein the first nucleotide sequence 

2 encodes two or more heterologous CTL or HTL peptide epitopes. 

1 50 . The method of claim 48, wherein the non-human mammal is a 

2 transgenic mouse that expresses a human HLA allele. 

1 51. The method of claim 50, wherein the human HLA allele is selected 

2 from the group consisting of Al 1 and A2. 1 . 

1 52. The method of claim 48, wherein the expression vector further 

2 comprise a second nucleotide sequence encoding a major histocompatiblity (MHC) 

3 targeting sequence. 

1 53. The method of claim 48, wherein the HTL peptide epitope is a 

2 universal HTL epitope. 

1 54. The method of claim 53, wherein the universal HTL epitope is a 

2 pan DR epitope. 

1 55. The method of claim 54, wherein the pan DR epitope has the 

I sequence AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

1 56. The method ofclaim 48, wherein the CTL or HTL peptide epitopes 

2 are hepatitis B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

1 57. The method ofclaim 56, wherein the CTL or HTL peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 Tables 1-8. 

1 58. The method of claim 57, wherein at least one of the peptide 

2 epitopes is an analog of a peptide depicted in Tables 1-8. 

1 59. The method of claim 52, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 
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3 LAMP-I, HLS-fll^HLA-DO, H2-D0. influenza, hepatitis B viri^Re antigen, Ty 

4 particle, Ig-a protein, Ig-p protein, and Ig kappa chain signal sequence. 

1 60. The method of claim 48, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous human CTL or HTL peptide epitopes. 

1 61. - The method of claim 48,-w.herein-the-vector comprises pMin.l or 

2 pEP2. 

1 . 62. The method of claim 48, wherein the CTL peptide epitope has a 

2 structural motif that provides binding affinity for an HLA allele supertype. 

1 63 . The method of claim 49, wherein the CTL peptide epitopes have 

2 structural motifs that provide binding affinity for more than one HLA allele supertype. 

1 64. The method of claim 48, wherein the expression vector comprises 

2 both HTL and CTL peptide epitopes. 
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10 20 30 40 50 60 70 

♦ *♦♦*♦♦*♦**♦♦♦ 

GCTAGCGCCGCCACaVTGGATGACCAACGCGACCTCATCTCTAACCATGAGCAAT^^ 
CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGATTGGTACTCGTTAACGGGTATGACCCGT 
MDDQRDLISNHEQliPILG> 

80 90 100 110 120 130 140 

ACCGCCCTAGAGAGCCAGAAAGOTGCAGCCGTGGAGCrCTGTACACCGGTGTTTCTGTCCTGGTGGCT^ 

TGGCGGGATCTCTCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACC^ 

NRPRSP2RCSRGALYTGVSVl'vAL> 

ISO 160 170 180 190 200 210 

GCTCTTGGCTGGGCAGGCCACCACTGCTTACrrCCTGTACC^^ • 
CGAGAACCGACCCGTCCGGTGGtGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
L L A G Q A T T- A Y - F L- Y - Q- Q .G. R L D K L T> 

220 230 240 250 260 270 280 

ATCACCrCCCAGAACCTGCAACrGGAGAGCCrrCGCATGAAGCrrCCG^ 
TAGTGGAGGGTCTTGGACGTTGACCTCtCGGAAGCGTACTTCGAAGGCr^ 
ITSQNLQLESLR MK LPK SAKPVA> 

290 300 310 320 330 340 350 

AGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAACATGCTCCrrTG^^ 

TOUVGCACCGACGGACCTGGGACTTCCGACGGCGATACAGGTACCTATTGTACGAGGAACCC^ 

KFVAAWTLKAAAMSMDNMLLGPVK> 

360 370 380 390 400 410 420 

GAACGTTACCAAGTACGGCAACATGACCCAGGACCATGTGATGCATCTGCTCACGAGGTCTGGACCCCT^ 
CTTGCAATGGTTCATGCCGTTGTACTGGGTCCTGGTACACTACGTAGACGAGTGCTCCAGA 

NVTKYGNM7QDHVMHLLTRSGPL> 

430 440 450 460 470 480 490 

^ ^ -k ^ * ♦♦♦«* ♦ * 

GAGTACCCGCAGCTGAAGGGGACCTTCCCAGAGAATCTGAAGCATCrTAAGAACTCCATGGATGGCGTGA 
CTCATGGGCGTCGACTTCCCCTGGAAGGGTCTCTTAGACTTCGTAGAATTCTTGAGGTACCTACCGa^ 
EYPQ.L KGT F P ENLKHL KN S MD GV> 

500 510 520 530 540 550 560 

ACTX^GAAGATCTTCGAGAGCTGGATGAAGCAGTGGCTCITGTTT^ 

TGACCTTCTAGAAGCTCTCGACCTACTTCGTCACCGAGAACAAACTCTACTCGTTC^ 

NWKI?ESWMKQW.LLFEMSKNSLEE> 

570 580 590 600 610 620 630 

♦ ♦♦♦♦*♦♦ *♦* *♦* 
GAAGAAGCCCACCGAGGCTCCACCTAAAGAGCCACTGGACATGGAAGACCTATOT 
CTTCTTCGGGTGGCTCCGAGGTGGATTTCTCGGTGACCTGTAC^ 

KKPTEAPP KEPLDMEDLS S GLGV> 

640 650 660 
if « * « * ♦ * 

ACCAGGCAGGAACTGGGTCAAGTCACCCTGTGAGGTACC 
TGGTCCGTCCTTGACCCAGTTCAGTGGGACACTCCATGG 
TRQELGQVTL*> 

FIGURE 1 
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10 20 30 40 50 60 70 

GCTAGCGCCGCCACCATGGATC»CCAACGCGACCrCATCTCTAACCATGAGa^TTGCCCATACTC3G^ 
CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGATTGGTACTCGTTAACGGGTATGACCCGT 
MDDQRDLISNEEQLPILG> 

80 90 100 110 120 130 140 

ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCTGGTGGCTCT 
TGGCGGGATCTCTCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACCGAGA 
NRPREPERCSRGALyTGVSVLVAL> 

150 160 17b 180 190 200 210 

GCTCTTGGCTGGGCAGGCCACCACTGCTTACTTCCTGTACCAGCAACAGGGCCGCCTAGACAA^ 
CGAGAACCGACCCGTCCGGTGGTGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
LI.AGQATTAYFLYQQ QGRI'DKLT> 

220 230 240 250 , 260 270 280 

ATCACCTCCCAGAACCTGCAACTGGAGAGCCTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTC 
TAGTGGAGGGTCTTGGACGTTGACCTCTCGGAAGCGTACTTCGAATAGTCGGTCCGACACGTGCGGCGAG 
ITSQNLQLESL RMKL1SQAVHAA> 

330 340 350 



290 300 310 320 

* '♦ * * ♦ * 

* * * 



*♦♦♦*♦ 



ACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGCTTATCGCCCTCCAAACGCTCCTATCCTGTTCTT 
TGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGGTTTGCGAGGATAGGACAAGAA 
KAEINEA GRTPPAYRPPNAPILFF> 

400 410 420 



360 370 380 390 

. ^ ***** * 

♦ « . * 



***** 



TCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTGGCTGCCT^^ 
AGACGACTGGTCTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAGCACCGACGGACCTGGGACTTC 
LLTRILTIPQSLDAKFVA AWTLK> 



430 
* * * 
GCTGCCGCTTGAGGTACC 
CGACGGCGAACTCCATGG 
A A A *> 



FIGURE 2 
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10 20 30 



40 50 60 70 



GCTAGCGCCGCCACCATTCATGACCAACGCGACCTCATCTCTAACCATGAGCAA 

CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGATTGGTA^ 

MDDQRDLISNHEQLPILG> 

80 90 100 110 120 130 140 

* ♦ * * * * * ^ * 

ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCTGG^ 
TGGCGGGATCTCTCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCAO^AAGACA^ 
NRPR EPERCSRGALYTGVSVLVAL> 

ISO ISO 170 180 190 200 210 

GCTCTTGGCTGGGCAGGCCACCACTGCTTACTTCCTGTACCAGCAAC^ 

CGAGAACCGACCCGTCCGGTGGTGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTOT^ 
L LAGQATTAYFLYQQQGR^^^LT^ 



220 



230 240 2S0 260 270 280 



ATCACCTCCCAGAACCTGCAACTGGAGAGCCTTCGCATGAAGCTTATCAGCCAGGCTC^ 
TAGTGGAGGGTCTIXSGACGTrGACCTCTCGGAAGCGTACTTCGAATAGTCGGTCCGAC^^ 
ITSQNLQLESLRMK LISQAVHAA> 

290 300 310 320 330 340 350 

* ******** 
ATOCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGCTTATCGCCCTCCAfl^ 

TGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGGTTTGCGAGGATAGGA^ 

K AEINEAGRTPPAYRPPNAP I L F F> 

3g0 370 380 390 400 410 420 

************** 

TCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTGGCTGCCTGGACCCTGAAG 

AGACGACTGGTGTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAGCACCGACGGACCTGGGACTTC 

lltrilt:pqsldakfvaawtlk> 



430 440 450 460 



470 480 490 



* 



GCTGCCGCTATGTCCATMATAACATGCTCCTTGGGCCTGTGAAGAACGTTACCAAGTACGGCA^ 
CGACGGCGATACAGGTACCTATTGTACGAGGAACCCGGACACTrCTTGOU^TGGTTCATGCCGTTGTACT 
AAAMSMDNMLLGPVKNVTKyGlIM> 



500 SIO 



520 530 S40 550 560 



CCCAGGACCATGTGATGCATCTGCTCACGAGGTCTGGACCCCTGGAGTACCCGCAGCTGAAGGGGACCTT 
GGGTCCTGGTACACTACGTAGACGAGTGCTCCAGACCTGGGGACCTCATGGGCGTCGACTTCCCCrG^ 
TQDHVMHLLTRSGPLEYPQLKGTF> 

S70 580 590 600 610 620 630 

CCCAGAGAATCTCAAGCATCrrAAGAACTCavTGGATGGCGTGAACTGGAAGATCTTCGAGAGCTGGATG 
GGGTCTCTTAGACTTCGTAGAArrCTTGAGGTACCTACCGCACTTGACCTTCTAGAAGCTCTCGACC^^^ 
PENLKHLKNSMDGVirWKlFESWM> 



FIGORE 3 
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640 630 660 670 680 690 700 

AAGCAGTGGCTCTTGTTTGAGATGAGCAAGAACTCCCTC3GAGGAGAAGAAGCCCACCGAGGCTCCACCTA 
TTCGTCACCGAGAACAAACTCTACTCGTTCTTGAGGGACCTCCTCTTCrrCGGGTGGCTCCGAGGTG^^ 
KQWLLFSMS:<NSLEEKK PTEAPP> 

710 720 730 • 740 750 760 770 



♦ ♦ • 



AAGAGCCACTGGACATGGAAGACCTATCTTCTGGCCTGGGAGTGACCAGGCAGGAACTGGGTCAAGTCAC 
TTCTCGGTGACCTGTAC(rrTCTGGATAGAAGACCGGACCCTCACTGGTCCGTCCTTGACCCAGTTCAGTG 
KEPLDMSDLSSGLGVTR QELGQVT> 



780 
♦ ♦ 

CCTGTGAGGTACC 
GGACACTCCATGG 
L *> 



FIGURE 3 CONTINUED 
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30 40 SO 60 70 

GCTAGCGCCGCCACCATGGGAATGCMGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTG<»3TGCCCG 
CGATCGCGGCGGTGGTACCCTTACGTCCACGTCTAGGTCTCGGACAAAGACGAfiGAGGACACCCACGGGC 
MGMQVQIQSLFL LLLWVP> 

80 90 100 110 120 130 

GGTCCAGAGGAATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCC 

CCAGGTCTCCTTAGTCGGTCCGACACGTGCGGCGAGTGCGGCTTTAGTTGCTTCGACCTTCTT^^ 

GSRG1SQAVHAAHAEINEAGRTPP> 

ISO 160 170 laO 190 200 210 



♦ 



* * * 



-AGCTTATCGCCCTCCAAACGerCCTATCCTGTTGTITCTGCTGACCAGAATCCTGACAATCCCCCAGTCC 
TCGAATAGCGGGAGGTTTGCGAGGATAGGACAAGAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCAGG 



220 230 240 



250 260 270 280 



♦ ♦♦*♦* *.* * 



CrGGACGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTAACAACATGTTGATCCCCATTGCTG 
GACCTGCGGTTCAAGCACCGACGGACCTGGGACTTCCGACGGCGATTGTrGTACAACTAGGGGTAACGAC 
LDAKFVAAWTL.KAAANNMLI P IA> 

290 300 310 320 330 340 350 

TGGGCGGTGCCCTGGCAGGGCTGGTCCTCATCGTCCTCATTGCCTACCTCATTGGCAGGAAGAGGAGTC^ 

IS^SScSaccgtcccgaccaggagtagcaggagtaacggatggagtaaccgtccttctcct^^^ 

V.GGALAGLVLIVLIAYLIGRKRSH* 

360 370 
♦ *♦.♦* 
CGCCGGCTATCAGACCATCTAGGGTACC 
GCGGCCGATAGTCTGGTAGATCCCATGG 
A G Y Q T I •> 



FIGORE 4 
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10 20 30 40 SO 60 70 

GCTAGCGCCGCCACCATGGCTGCACTCTC3GCTGCTGCTGCTGGTCCTCAGTCTGCACTGTATGGGGATCA 
CGATCGCGGCGGTGGTACCGACGTGAGACCGACGACGACGACCAGGAGTCAGACGTGACATACCCCTAGT 
MAALWLLLLV LS LHCMGI> 



80 90 100 110 120 130 "0 

. * * * * * * * * * * * * * 

GCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGCTTATCGCCCTCC 

CGGTC'-GACACGTGCGGCGAGTGCGGCTTrAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGG 

sq""avkaahabineagrtppayrpp> 

150 160 170 180 190 200 210 

aaacgctcctatcctgttctttctgctgaccagaatcctgacaatcccccagtccctggacgcou^gct^ 

rrTGCGAGGATAGGACAAGAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAG 
NAPILFFLLTRILTIPQSLDAKF* 

220 230 240 250 260. 270 280 

GTGGCTGCCTGGACCCTGAAGGCTGCCGCTAAGGTCTCTGTGTCTGCAGCCACCCTGGGCCTGGGCTTCA 
CACCGACGGACCTGGGACTTCCGACGGCGATTCCAGAGACACAGACGTCGGTGGGACCCGGACCCGAAGT 
VAAWTL KAAAK.VSVSAATLGLGF> 

330 340 350 



290 300 310 320 

____ *♦♦* 



♦ ♦****♦ 



TCATCTTCTGTGTTGGCTTCTTCAGATGGCGCAAGTCrCATTCCTCCAGCT^ 

Stagaagacacaaccgaagaagtctaccgcgttcagagtaagg 

IlF CVG??RWRKSHSSSyTPLPGS> 

360 370 380 

****** 

cacctacccagaaggacggcattagggtacc 
gtggatgggtcttcctgccgtaatcccatgg 

TypEGRH*> 
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10 20 30 40 50 60 70 

******** 

GCTAGCGCCGCCAC^TGG^CGCT^GAG^CCCCCTGGGTGGTGGCTCTGT^^^ 

cS^SSLtggtacccgcgaccctcccgggggacccaccaccga^ 

MGAGRAPWVVA LLVNLMR> 

80 90 100 110 120 130 140 

^ ********* * 
TGGATTCCATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCC^^ 

Ifc^SGSIScGGTCCGACACGTGCGGCGAGTGC^^^ 

LDS I SQAVH.AAHAEI NEAG RTPPA> 
150 ^ 160 ^ 170 ^ 180 ^ 130 ^ 200 ^ 210 

TTAfc-GCCC^CCSAXCGCCCCTATCCTG^ 

S^SSS^GCGAGGATAGGACAAGAAAGACGACTGGTCTTAGG^^^ 

YRPPNAPILFFL LTRILTIPQSL> 



240 ^ 250. ^ 260 ^ 270 ^ 230 

★ 



GACGCCAAGTTCGTGGC-C-CCTGGACCCTGAAGGCTGCCSCTATACTGAGTGGAGCTGCAGT^^ 
SSSi??nGCACCGACGGACCTGGGACTTCCGACGGCGATATGACTCACC^^^^^ 
DAKFVAAWTLKAAAIL. S GAAVti.? 



290 300 310 320 330 ^^40^ 350 



* 



L G L I V F LVGVVI HLKAQKASVETU 



.TTGTCTTCCTGGTGGGGGTTGTTATCCATCTCAAGGCTCAGAAAGCATCTGTGGAGACTCA 



360 370 ^ 380 ^ 390 ^ 400 _ 410 ^ 420 

gcc4c:aa;gagagtagg;cccg;atga;ggag^gctIac^^^ 

CGGACCGTTACTCTCATCCAGGGCCTACTACCTCGCCGATTGGTTCAAGTTCCGACCTGGCCCT^^^ 
PGNESR SRMME RLTKFKAGPt,« 

430 

* * r 
ACATGAGGTACC 
TGTACTCCATGG 
T *> 
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10 ^ 30 ^ 40 ^ SO ^ «0 ^ '0 

acx.;cccc;cc.ckxc.;a-"CGT;c«^^ 



CGATCGCGGCGGTGGTACCGG' 



irrCAAGCACCGACGGACCTGGGACTTCCGACGGCGATACTCAOAAGATT 
MAKF VAAWTLKAAA.MSt.L> 



CCGA^TCGLvACGTACGT^CrCT^TATclTCCclTCWS^CCCCCT^ 
ScrCaVGCTTTGCATGCAAGAGAGATAGTAGGGTAGTCCGGGGGACTTTCGGCTCTAGCK 

TEVETYVLSIIPSGPLKASIAQRL> 
ISO 160 170 180 190 200 210 

SSISSII?^S?=-=TaTCrAGAACTCCGA«««A^^^^ 

E D V ? A G—K-~S- -T--D -L- E--A--I.-M- E, W— t- K -T-R P-. I> 

220 230 240 2S0 260 ^ 270 ^ 280 

290 300 ^ 310 ^ 320 ^ 330 ^ 340 ^ 3S0 

AGCGTAGAcL^^TTGTCCAlAATGCCCTAlATGGGAATGilGACCCAAACAACATGGACA^ 
??S?SSS?AGGTTrrACGGGATTTACCCTTACCTC«GGTTTG^^^^ 

QRRRF.VQKALNGNGDPNNMD RAVK> 

360 370 380 ^ 390 ^ 400 ^410 ^ "0 

ACTATACAA^GCTGAAGlo«5AlATGA»TrckTGGAGCAJVAGGAAGT^ 

^SS?S?S?cScTTCTCCCrTTACTGTAAGGTACCTCOTTTCCTT^^ 

LYKKLKRHMTFHGAKEVALSYST> 

430 440 4S0 460 



90 100 110. 120 130 140 



TGATATGTTCTTCGACTTCTCCCrTTACTGTAAGGTACCTCGTTTCCrrCAACGTGAGTCAATX^ 

I T F H G A K 

aSTGCGCTr^CaG^TGCAisGGrCTCATlTACAlcCGGAT^^ 

S^Sa^^o^c^^^^ 

SOO SIO ^ 520 ^ S30 ^ 540 ^ 550 ^ 560 

GCCTlCTATCTGCcIcTrGTGAC.clGATTGCTGATCCCclACATCGGTCCCACAGGCAGAT^^ 

cSSSSgaacactccixctaacgactacgggtxgtagccag^^^^ 



GLV CATCSQI ADAQHRS 
570 



S80 SSO 600 610 620 630 



f^CCAACCCACTAATCAGGCATGAGAACAGAATGGTACTAGCCAGCACTACGGCTAAGGCCATG^^ 
f^SS^^AGXCCG^A^^^^^^^^ 

640 ^ 650 ^ 660 ^ 670 ^ 680 ^ 690 ^ 700 

ATGGCTGGATCAAG^SAGclGGCA^C^GA^GCCATGGAAGTCGaU^GTCAGGCTi^^ 

JISSSAGTTCACrCGTCCGTCGTCTCCGGTACCTTCAGCG^^ 
MAGSSEQAAEAMEVASQARQWVU 
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710 



720 730 740 750 760 770 



CAATGASGAOUVTTGGGACTCACCCTAGCTCCAGTGCAGGTCTAAAAGATGATCTTATTGAAAAT^^ 

SJIScSSiSScTGAGTGGGATCGAGGTCACGTCCAGATTTTCTACT^^ 
AMRT1GTKPSSSAGLKDDLIENLQ> 

780 790 800 810 



GGCTTACCAGAAACGGATGGGGGTGCAGATGCAGCGATTCAAGTGA 
CCGAATGGTCTTTGCCTACCCCCACGTCTACGTCGCTAAGTTCACT 
AYQKRMGVQMQRFK*> 
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MAKFVAAWTLKAA. AL=.IG> 
80 90 100 110 120 ^ 130 ^ "0 



♦ ♦ ♦ t 



* * 



rirrCTGCCTGAACGCCGAGAACATCACATCAGGATTOTACXJACCCCTTCrrCGTGTTACAGG 

SSScSS^SgS^otagtgtagtccta^^^ 

GPCLNAENITSGFLGPLLVLQAGf> 
160 ^ 170 ^ 180 ^ 130 ^ 200 ^ 210 

TrrC^TGrrkcAALlTCCTCAclATACCGCAGlGTCrlGACT^^^ 
inSISISGTTCTTAGaAGXGTTATGGCOTCTCAGATCTGAGCAC^^^ 

Oa^AACTlcCGT;TGTC;TGackAAA;TCGclGTCCCC^^^^ 
CCCCCTTGATGGCACACAGAACCGGTTTTAACCGTCAGGGGTIGGAGGTTAGTGACT^^ 
GGT TVCl.GQNSQS .P TSNhSPT5V.> 

290 300 310 320 330 ^ 340 ^ 350 



GAGGTTGAACAGGACC 
p p T C P G 



YRWMCLRRFIIFLFILLL> 



GAGGTTGAACAGGACCAATAGCGACCTACACAGACGCCGCAAAATAGTAGAAGGAGAAGTAGGAC^^^^^ 

C h 

470 480 490 



IFLLVLLDY QG MLPVCP 



430 440 450 4S0 

****** 



rCAACAACCAGCACGGGACCATGCOKSACCTGCATGACTACTGCTCAAGGAACCT^^^^^ 
AGGAGTTGTTGGTCGTGCCCTGGTACGGCCTGGACGTACTGR^ A q G T S M Y P> 



S~S T TSTGPCRTCMT 



★ 



c^n 540 550 560 

500 510 520 ^ 530 ^ ^ " * * * 

^CCTGTATTCCCATCCCATCATCCTGGGCTTTCGG 
fTAGGGTAGTAGGACCCGAAAGCC 
XPSSWAFGi 



CCTGTTGCTGTACCAAACCTTCGGACGGAAATTGCAC-.^^^^^^^^^ 

S c C C T K P S D G K C% C I P I ^' S S « A F G> 



GGACAACGACATGGTTTGGAAGCCTGCCTTTAACG' 



* * 



• * 



;^;cct;tggoIgtgg;cctcL.cccgtttctcc:tg^^^^^ 

TTTTAAGGATACCCTCACCCGGAGTCGGGCAAAGAGGACCGMTCAAATGAT^^^ 
KFLWEWAS ARFSWLSLLVPi: 

FIGORE 8 
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fsO 700 



640 ^ * * 

TrCGTAGGGCTTTCCCCCACTGTTTGGCnTCAGTTATATGGATGATGTGGTATTG<^ 
AAGCATCCCGAAAGGGGGTGAO^CCGAAAGTCAATATACCTACTACACCATAACCCCCGGTTCAGACA 
FV GLS PTVWLSVIWM MWYWGPS L> 

710 720 730 740 750 760 770 

♦ * 

ACAGaVTCTTGAGTCCCTTTrTACCGCTGTTACCAATrrTCTTTTGTCTTTGGGTATACAT^ 
TGTCGTAGAACTCAGGGAAAAATGGCGAOUITGGTTAAAAGAAAACAGAAACCCATATGTAAATTTGGOA 
Y.SII. SPFLPLLPIFFCLWVYI*> 

780 790 800 

♦ «*♦*♦ 

AACAAAACAAAGAGATGGGGTTACTCTCTAA 
TTGTTTTGTTTCTCTACCCCAATGAGAGATT 
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40 50 60 70 



10 20 ^ w . 



ao 90 100 110 



120 130 140 



♦ 



yACLGPGCQAlSQAVHAAHAEINt> 
ISO ISO 170 180 ^ 190 200 ^ 210 




ACRTPPAYRP 



P M~A P I L-F--F L L T R r> 




290 300 310 



320 330 340 



A»ACCy^CACACCTCACCACCCTCCCTCCCACGACCATA>^ 

360 370 ' 380 390 «0 ^ UO ^ «0 




XXCACCTCrTACCGTCTACTCATACIT^AUiiii^^--'"'------- D D C S> 

GVDMP DDYEDENLYECLNLDD.c 



4S0 ^ 460 ^ 470 ^ 480 ^ 490 

TACATACTCCT^AGACX^CCCTCAC^CCCn«:ATCGTCCTACACCCCT^ 
MY E D. I S RG L Q GTY Q D VC N L H 

SOO SIO 



CCCAGCTCGAARAGCCATGAGCTACC 
GOGTCGACCrTTTCGGTACTCCATCG 
A Q L E K P •> 
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10 :^20 30 . SO ^ 70 

140 




(^ACWACTAGTAOJAGAAGTAGTAACACCKXJTWSAAGGACGATGAACT^ K A G> 

Tr- L li •■ r T. L ?-• I I V P U F L L L D 1^ u 

360 370 ' 380 390 400 410 ^ 




rACCTCCTTCTAGTCTGGATACTCCCGAACrrGTAACTGGTtri^i^^^v^-^^ I V •& 

MEEDHTYEGLNlDQTATYEDi. 

430 440 ^ 450 ^ 4G0. ^ 470 ^ 430 

CrrC^CAi^GGA^AAlGTGG^lGGAGAGCATCCA^^ 
GSAGCCTCTCCCCrCCATTTCAarAGCCATCCTOIGTAGGTCCGGTCCCTAC^ 

LRTGEVK WSVGEHPGQE > 
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# w 

20 30 40 50 60 70 



GCrAGCGCCGCCACCATaXAATCCAGGTGCAGATCCAGAGCCiL, I i"I'C'ICCl^Ll^^caU'iX;GGTGCCCG 
CGATCGCGGCGGTGGTACCCTTACCnrCACCriXrrAGGTC^ 

MGMQVQIQSLFLLLLWVP> 

80 90 100 110 120 130 140 



GGTCCCGAGGAATCAGCCAGGCTGTCCACGCCGCTCACCCCGAAAT^^ 
CCAGGGoi^v,-x-rACTCGCTCCCACACGTGCCGCGAGTGCCGCTCT 

GSRGISQAV H A AHAEINEAGRTPP> 

ISO ISO 170 180 190 200 210 

AGCTTATCGCCCTCCAAACCCTCCT^ 

_-?^iS^J^??pAGcrr^^ 

AYRPPN APILFFLLTRILTIPQS> 
220 230 240 250 2S0 

CTGGACGCCAAGTTCGTGGCTGCC 

GACCTGCGGriCAAGCACCGACGGACCTGGGACTrCCGACGGCGAACTCCA^ 
LDAKFV AAWTLK AAA*> 
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5 10 



CCA GTC ATG GAT GAC CAG CGC GAC CTT ATC TCC AAC AAT GAG CAA CTG 
Val Met Asp ASP Gin Arg Asp Leu He Ser Asa Asn Glu Gin Leu 
IS 20 " 

CCC ATG CTG GGC CGG CGC CCT GGG GCC CCG GAG AGC AAG TGC AGC CGC 
p'o Leu Gly Ars Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg 
35 40 45 

GCA GCC CTG TAC ACA GGC TTT TCC ATC CTG GTG. ACT CTG CTC CTC GCT 
Ty Sa Leu Tyr Thr Gly Phe Ser He Leu Val Thr Leu Leu Leu Ala 
SO SS 60 

GGC CAG GCC ACC ACC GCC TAC TTC CTG TAC CAG CAG CAG GGC CGG CTG 
S-Sn Ala Thr Thr Ala Tyr Phe Leu Tyr Gin Gin Gin Gly. Arg Leu 
65 70 75 

«r.7> r-"- -rr T-C CAG AAC C-G CAG CTG GAG AAC CTG CGC 
GAC AAA CTG ACA Giw «CC Ts,C t-A^a AAi- c-^j v_rt« w 

ASD Lvs Leu Thr Val Thr Ser Gin Asn Leu Gin Leu Glu Asn Leu Arg 
" 80 « 

ATG AAG CTT CCC AAG CCT CCC AAG CCT GTG AGC AAG ATG CGC ATG GCC 
net Lys Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala 
95 100 105 

ACC CCG CTG CTG ATG CAG GCG CTG CCC ATG GGA GCC CTG CCC CAG GGG 
?S pro Leu Leu Met Gin Ala Leu Pro Met Gly Ala Leu Pro Gin Gly 
115 120 125 



CCC ATG CAG AAT GCC ACC AAG TAT GGC AAC ATG ACA GAG GAC CAT GTG 
pro Mit Sn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp H.s Val 
130 135 1*0 



ATG CAC CTG CTC CAG AAT GCT GAC CCC CTG AAG GTG TAC CCG CCA CTG 
Met is Su Leu Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu 
145 150 

AAG GGG AGC TTC CCG GAG AAC CTG AGA CAC CTT AAG AAC ACC ATG GAG 
Lys Gly ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu 

ACC ATA GAC TGG AAG GTC TTT GAG AGC TGG ATG CAC CAT TGG CTC CTG 
Te Sp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu 

175 



49 



97 



145 



193 



241 



289 



337 



385 



433 



481 



529 



577 
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TTT GAA ATG AGC AGG CAC TCC TTG GAG CAA AAG CCC ACT GAC GCT CCA 
Phe Glu Met Ser Arg Kis Ser Leu Glu Gin Lys Pro Thr Asp Ala Pro 
19S 200 205 

CCG AAA GAG 7CA CTG GAA CTG GAG GAC CCG TCT TCT GGG CTG GGT GTG 
Pro Lys Glu Ser Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val 
210 215 220 

ACC AAG CAG GAT CTG C-GC CCA GTC CCC ATG TGAGAGCAGC AGAGGCGGTC 
Thr Lys Gin Asp Leu Gly Pro Val Pro Met 
225 230 



£25 



673 



723 
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ACC CAG GTC CAC ATG .AAC AAC GTG ACC GTA ACG CTC CAT GAT GCC ACC 
Thr Gin Val His Mec Asn Asn. Val Thr val Thr Leu His Asp Ala Thr 
160 16S l-'O 

ATC CAG GCG TAC CTT TCC AAC AGC AGC TTC AGC AGG GGA GAG ACA CGC 
He Gin Ala Tyr Leu Ser Asn Ser Ser Phe Ser Arg Gly Glu Thr Arg 
175 180 185 



WO 99/58658 PCTAJS99/10646 

CCGCCTCGGC ATG GCG CCC CGC AGC GCC CGG CGA CCC CTG CTG CTG CTA 229 
Met Ala Pro Arg Ser Ala Arg Arg Pro Leu Leu Leu Leu 
1 S 10 

CTG CCT GTT GCT GCT GCT CGG CCT CAT GCA TTG TCG TCA GCA GCC ATG 277 
Leu Pro Val Ala Ala Ala Arg Pro His Ala Leu Ser Ser Ala Ala Met 
IS 20 25 

TTT ATG GTG AAA AAT GGC AAC GGG ACC GCG TGC ATA ATG GCC AAC TTC 32S 
Phe Met Val Lys Asn Gly Asn Gly Thr Ala Cys He Met Ala Asn Phe 
30 35 40 « 

TCT GCT GCC TTC TCA GTG AAC TAC 6AC ACC AAG AGT GGC CCC AAG AAC 373 
ser AT a Ala Phe Ser Val Asn Tyr Asp Thr Lys Ser Gly Pro Lys Asn 
50 55 «0 

ATG AC^ TTT GAC CTG CCA TCA GAT GCC ACA GTG GTG CTC AAC CGC AGC 421 
Met Thr Phe Aso Leu Pro Ser Asp Ala Thr Val Val Leu Asn Arg Ser 
65 70 75 

TCC TGT GGA AAA GAG AAC ACT TCT GAC CCC AGT CTC GTG ATT GCT TTT 4S9 
Se- Cys Gly Lys Glu Asn Thr Ser Asp Pro Ser Leu Val He Ala Phe 
80 85 . 90 

GGA AGA GGA CAT ACA CTC ACT CTC AAT TTC ACG AGA AAT GCA ACA CGT . 517 
Gly Arg Gly His Thr Leu Thr Leu Asn Phe Thr Arg Asn Ala Thr Arg 
95 100 105 

TAC AGC GTT CAG CTC ATG AGT TTT GTT TAT AAC TTG TCA GAC ACA CAC 565 
Tvr ser Val Gin Leu. Met Ser Phe Val Tyr Asn Leu Ser Asp Thr His 
llo lis • 120 125 

CTT TTC CCC AAT GCG AGC TCC AAA GAA ATC AAG ACT GTG GAA TCT ATA 613 
Leu Phe pro Asn Ala Ser Ser Lys Glu He Lys Thr Val Glu Ser He 
130 135 140 

ACT GAC ATC AGG GCA GAT ATA GAT AAA AAA TAC AGA TGT GTT AGT GGC 661 
Thr Asp He Arg Ala Asp He Asp Lys Lys Tyr Arg Cys Val Ser Gly 
145 ISO 155 



709 



757 
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# 9 

TGT GAA CAA GAC AGG CCT TCC CCA ACC ACA GCG CCC CCT GCG CCA CCC 805 
CVS Giu Gin Aso Arg Pro Ser Pro Thr Thr Ala Pro Pro Ala Pro Pro 
190 " 195 200 205 



AGC CCC TCG CCC TCA CCC GTG CCC AAG AGC CCC TCT GTG GAC AAG TAC 853 
Se- o-o Ser Pro Ser Pro Val Pro Lys Ser Pro Ser Val Asp Lys Tyr 
210 215 220 



GGC CAG TTT GGC TCT GTG GAG GAG TGT CTG CTG GAC GAG AAC AGC ACG 
Gly Gin Phe Gly Ser Val Glu Glu Cys Leu Leu Asp Glu Asn Ser Thr 
370 3-7S 380 



901 



AAC GTG AGC GGC ACC AAC GGG ACC TGC CTG CTG GCC AGC ATG GGG CTG 
Asn val ser Gly Thr Asn Gly Thr Cys Leu Leu Ala Ser Met Gly Leu 
22S 230 235 

CAG CTG AAC CTC ACC TAT GAG AGG AAG GAC AAC ACG ACG GTG ACA AGG 949- 
Gin Leu Asn Leu Thr Tyr Glu Arg Lys Asp Asn Thr Thr val- Thr Arg 
240 245 250 

CTT CTC AAC ATC AAC CCC AAC AAG ACC TCG GCC AGC GGG AGC TGC GGC 997 
Leu Leu Asn He Asn Pro Asn Lys Thr Ser Ala Ser Gly Ser Cys Gly 
2SS 2S0 265 

GC CAC CTG GTG ACT CTG GAG CTG CAC AGC GAG GGC ACC ACC GTC CTG 1045 
Ala His Leu Val Thr Lsu Glu Leu His Ser Glu Gly Thr Thr Val Leu 
270 * 275 280 285 

CTC T^" CAG TTC GGG ATG AAT GCA AGT TCT AGC CGG TTT TTC CTA CAA 
Leu Phe Gin Phe Gly Met Asn Ala Ser Ser Ser Arg Phe Phe Leu Gin 
290 295 300 



1093 



GGA ATC CAG TTG AAT ACA ATT CTT CCT GAC GCC AGA GAC CCT GCC TTT 1141 
Gly He Gin .Leu Asn Thr He Leu Pro Asp Ala Arg Asp Pro Ala Phe 
305 310 315 

AAA GCr GCC AAC GGC TCC CTG CGA GCG CTG CAG GCC ACA GTC GGC AAT 
Lys Ala Ala Asn Gly Ser Leu Arg Ala Leu Gin Ala Thr Val Gly Asn 
320 325 330 

TCC TAC AAG TGC AAC GCG GAG GAG CAC GTC GGT GTC ACG AAG GCG TTT 
ser Tyr Lys" Cys Asn Ala Glu Glu His Val Arg Val Thr Lys Ala Phe 
335 340 345 

TCA GTC AAT ATA TTC AAA GTG TGG GTC CAG GCT TTC AAG GTG GAA GGT 
Ser val Asn He Phe Lys Val Trp Val Gin Ala Phe Lys Val Glu Gly 
350 355 3S0 365 



1189 



1237 



1285 



1333 
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CTG ATC CCC ATC GCT GTG GGT C3GT GCC CTG GCG GGG CTG GTC CTC ATC 
Leu He Pro He Ala Val Gly Gly Ala Leu Ala Gly L€u Val Leu He 
38S 390 395 

GTC CTC ATC GCC TAC CTC GTC GGC AGG AAG AGG AGT CXC GCA GGC TAC 
val Leu He Ala Tyr Leu Val Gly Arg Lys Arg Ser His Ala Gly Tyr 
400 405 410 

CAG ACT ATC TAGCCTGGTG CACGCAGGCA CAGCAGCTGC AGGGGCCTCT 
Gin Thr He 
415 



PCT/US99/I0646 



1381 



1429 



1478 
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# t 

M I T F L • 

XOO IXO 120 ^ X30 ^ 140 



*»♦♦♦* 



^TGCTGGGACTCCAAAG^TTTCACATACTGCATCTCCTTCAA 
TACACCTTTCGTGGACAGACAACCTACTACGACCCTGAGGTTTCCTAAAGTGTAT^^^^ 



ATGTGGAAAGCACCTGTCTGTTGGATGA' 

TACACCTTTO 
H V E S 



I C 1 I. D D A G T P K D F T If C I S F ». 



« « ★ * 



* ♦ ♦ 



rAAG^ATCTGCTGACCTGCrGGGATCCAGAGGAGAATAAGATGGCCCCTTGCGAATTTGG^^ 

^SSS^TGGACGACCCTAGGTCTCCTCTTArrCTACC^^ 

XDLLTCWD PEENKMAPCEFevu 



220 230 240 2S0 -- ^ ^ 

♦ ♦•♦** 



260 270 280 

* * • * 



TCGAACCGCTTACAGGAGAGTGTCGTGGAGTTGGTTTTTCTGTGGGACTACGTCGCG^^^ 
SLAN VL3QHLNQKDTL 



M Q R L a M G> 



AGC-rGGCGAATGTCCTC:CACAGCACCTCAACCAAAAAGACACCCTGATGCAGCGCTTG^^^ 

sttggtttttctgtggg; 

N Q K D T 1 

320 ^ 330 ^ 340 ^ 350 

^CA^CcIcAC^^CC^CT^^ 

aagtcttaacacggtgtgtgtgggtcgggaagacccctagtgacttcttctcctct^. 
lqncatetqpfwgsltnrtrf 

400 410 420 

* * 



360 370 380 390 



ocaa;tagccaaaaccac.=cttttaacacgagggagcctgt^^^^^^ 
cgttcatcggttttggtgaggaaaattgtgctccctcggacactacgaccg^^^ 

QVAKTT PFNTREPVMLACX 

ii70 490 490 

430 440 450 460 ^ , * * 

cm 540 SSO 560 

SOO 310 520 530 I . * * * 



TCTGACGGGTCGGGTTACCTCTGACCTGTATGGTCTGGGAGAGGGTAAATCGeAAi T P S Y G> 
KTAQPNGDWTYQTLSHLAi. 
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570 ^ SSO ^ 5.0 ^ eoo ^ .10 ^ 620 ^ «0 
S^^Sx^ScACACCATCTCGTGTMCCCCGAGGACTCGG^^^^ 

660 670 ^ 680 ^ 690 ^ 700 



LGIiGLIir> 



GACAGGGGGTACGTCTGGGACTTCCAAAGACACAGACGTCACTGAGACCCGGACCCGGAGTAGTAGAA^ 
LSPMQTiKVSVSAVT 



750 760 770 

♦ ♦♦**♦ 



710 720 730 740 ^ ^ ^ ^ ^ 



CTCT;GGTGTGATCAGCTGGCGGAGAGCTGGCCACTCTAGrrACA^^ 
GAGAACCACACTAGTCGACCGCCTCTCGACCGGTGAGATCAAT^^^^ 



780 790 
♦ * ♦ * 

AGAAGGATGGCACAnTCCTAG 
TCTTCCTACCGTGTAAAGGATC 
S G W H X S *> 
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An SO 60 70 

M G S G 



WVPW VVALLVNLTQLDSSM* 



30 ^ 100 ^ 110 ^ ^ , 

CTCilGGC;.^GACTCTCciGAAGlTrrT^TGATicAfi^ . 
.S^^CGTGTCTGAGAGGTCTrCTAAAACACTAAGTCCGTTTCCGACTGAO^^^ 

TQGTDSPEDFVIQ AKADCI* 

2KVQFVVR FIFNLEEYVRFUS 

-yen 270 280 

220 230 240 250 260 ^270 ^ ^ 

GMFVALTKL GQPOAEQWNS. 
rERSRQAVDGVCRHNYRLGAP.TV 



360 370 380 



390 



410 420 



CCCCTCTTTTCACGTTGGTCTCCACTGTCACATGGGTCTCTCCT^ «llHQHNL> 

480 490 



G R K y Q ? 



EVTVYPERTP 



460 470 



430 . 440 450 * * * 

« * • ♦ ^ * * 



CTGcIcTGCTCTGTGACAGGCTTCTATC^^^ 

550 560 



GACGTGACGAGACACTGTCCGAAGATAGGTCCCCTA'...- v WFLNGQE> 

LHCSVTGFYPG DiKlKwr 



520 530 540 



500 510 ^" * * * * * 

******* 
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AGAGAGCTGGGGTCATGTCCACTCMCCCTATCAGGAATGGAGACTGGACCTTTCAGACTGTGGTGATC 
™SS^GTACAGGT(^CCGGGATAGTCCTTACCTCTGACCTG<^ 

ERAGVMSTGPIRNGDWTFQTVVML> 

570 580 590 600 610 620 630 

AGAAATGACTCCTGAACnrGGACATGTCTACACCTGCCTTGTCGATCACTCCAGCCrGCTG^^^^ 
TOTASGAGGACTTGAACCTGTACAGATGTGGAOXyvACA^^^^ 

e'mtpeI'Ghvytclvdhssllspv> 



.... 680 690 700 



6S0 660 670 

* * ♦ 

♦ ♦ 



♦ ♦ ♦ * * 



-CTG-GGAGTGGAGAGCTCAGTCrGAATATTCTTGGAGAAAGATGCTGAGTGGCATTGCAGCCTTCCTJ^^ 
;S«SS?ScTCaAGTCAGACTTATAAGAACCTCTrTCTACGACTCACCGTAAC^^^^ 
■ i V V W R A .Q~S " E Y S -W-R- M L S G I A A F L> 

710 720 730 740 750 760 770 

TT-GGCTAATCTTCOTCTGGTGGGAATCGTaXCCAGCTAAGGGCTCAGAAAGGATATGTGAGGACGCA 

HcSSSISIgoaagaccacccttagcagtaggtcgattcccgagt^ 

780 790 800 . 810 820 

********** 

GATGTCTGGTAATGAGGTCTCAAGAGCTGTTCTGCTCCCTCAGTCATGCTAA 

CTACAGACCATTACTCCAGAGrrCTCGACAAGACGAGGGAGTCAGTACGATT 
--•^VSRAVLLPQSC*> 
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iTGCCTGGG^TCcL«3AGTCCTckAGCTCTGCCTGCCACCATCTTCCTCCTCTTC^^^ 

?I?SSS?SS?Stcaggaggttcgagacggacggtgg7agaaggagga 

MPGG PGVLQALPATIFLLFLLSA> 
80 90 100 110 120 ^ "0 ^ 140 

VYLGPGCQALWMHKVPASLMVSb.e> 



160 170 180 190 200 210 

♦ * ♦ * * * 



"0 - 



GGAAGACGCCCACrTCCAATGCCCGCACAATAGO.G(»ACAACGCCAACGTCACCTGGT^^^^ 

^??S^cSSgaaggt.acgggcgtgttatcgtcgttgttgc^^^^^ 



E 



DAHFCCPHNSSNNANVT. WW RVL> 
230 240 250 260 ^ 270 ^ 280 



* * * * 



CATG^CAAciACACGTGGCCCCCXGAGTTCTTGGGCCCGGGCGAGGACCCCAATGGTACGCT^^ 
SI«^S?G?^CACCC^CTCAAGAACCCGGGCCCGCTCC^^^ 
HGNY TW? P S FLGPGED P NG T L 

290 300 ^ 310 ^ 320 ^ 330 ^ 340 ^ 350 

agaa;gtgaIcaagIgcca;gggc;cata;acg4gcc;^^^ 

TCTTACACTTGTTCTCGGTACCCCCGTATATGCACACGGCCCAGGTCCTCCCGTTC^^ 
QNVNK SHGGI YVCRVQEGNES 



360 370 380 390 



400 410 420 

* * 



GTCCTGCGGCACCTACCTCCGCGTGCGCCAGCCGCCCCCCAGGCCCW^^ 
VGGACGCCGTGGATGGAGGCGCACGCGGTCGGCGGGGG- 
SCGTYLRVRQ PPPRP 



CAGGACGCCGTGGATGGAGGCGCACGCGGTCGGCGGGGGGTCCGGGAAGGACCT^^^^ 



430. ■ 440 450 460 470 480 ^ 490 



♦ * * 



iUVGAlcCGAlTCATkcAC-CCGAGGGGATCATCCTCCTGTTCTGCGCGGTGGTGCCTC^^^ 

™™g?g^=ggctcccctagta^^^^ 

KNRIITAEGIILLFCAVVPGTI. 
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550 560 



ISIctcSwCTACCG .,™....T.rr^CCCCTACTTATACTTCTACTTTreGA 

L 



'f"r K RWQNEKLGLDAGD2YEDENL> 



^510 S20 530 

♦ ♦ * * * 
AGAACGAGAAGCTCGGGTTGGAT 

iTCTTGCTCTTCGAGCCCAACCTACGGCCCCTACTTATACTTCTAa 

D 2 Y E D E 

570 580 590 600 610 620 630 

, \ *********** * 

TrATGAAGGCCTGAACCTGGACGACTGCTCCATGTATGAGGACATCTCCCGGGGCCTCCAGGGW^^ 

SSS^SgScttggacctgctgacgaggtacatactcctgtagagggccccggaggt^ 

YEGLNLDDCSMYEDISRGLQGTY* 

tfcn «70 680 690 700 

640 6S0 ^ 660 ^ 670 ^ bhw 

caggItgtg^cagcctcaIcata^tgtccIgctggagaagccgtgacacccctj^^^ 

^^SIScCCGTCGGAGTTGTATCCTCTACAGGTCGACCTCTTCGGCAC^ 

qdvgslnigdvqlekp*> 
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# • 

GAArrCCGCG GTG^CC AXG GCC AGG CIG GCG TTG TCT CCT GTG CCC AGC 
Met Ala Arg Leu Ala Leu Ser Pro Val Pro Ser 
1 5 10 

CAC TGG ATG GTG GCG TTG CTG CTG CTG CTC TCA GCT GAG CCA GTA CCA 
Z S M^t val Ala Leu Leu Leu .Leu Leu Ser Ala Glu Pro Val Pro 
IS 20 25 

GCA GCC AGA TCG GAG GAC CGG TAC CGG AAT CCC AAA GGT AGT GCT TGT 
S Sa Ts ser Glu Asp Arg Tyr Arg Asn Pro Lys Gly Ser Ala Cys 
30 35 *0 

TCG CGG ATC TGG CAG ASC CCA CGT TTC ATA GCC AGG AAA CGG CGC TTC 
Ie° tfs ne Tr? Gin Ser Pro Arg Phe He Ala Arg Lys Arg Arg Phe 

45 SO - - ■ 55 

ACG GTG AAA ATG CAC TGC TAC ATG AAC AGC GCC TCC GGC AAT GTG AGC 
val Lys Met His Cys Tyr Met Asn Ser Ala Ser Gly Asn Val Ser 
60 « 'V 

TGG CTC TGG AAG CAG GAG ATG GAC GAG AAT CCC CAG CAG CTG AAG CTG 
Tro Leu Tro Lys Glr. Glu Met Asp Glu Asn Pro Gin Gin Leu Lys Leu 

• 80 8S 50 

GAA AAG GGC CGC ATG GAA GAG TCC CAG AAC GAA TCT CTC GCC ACC CTC 
Glu Lys Gly Arg Met Glu Glu Ser Gin Asn Glu Ser Leu Ala Thr Leu 
95 100 

ACC ATC CAA GGC ATC CGG TTT GAG GAC AAT GGC ATC TAC . TTC TGC CAG 
Th' Ve Gin Gly He Arg Phe Glu Asp Asn Gly He Tyr Phe Cys Gin 
110 lis 120 

CAG AAG TGC AAC AAC ACC TCG GAG GTC TAC CAG GGC TGC GGC ACA GAG ' 433 
Gin Lvs Cys Asn Asr. Thr Ser Glu Val Tyr Gin Gly Cys Gly Thr Glu 
125 130 1^^.. 

CTG CGA GTC ATG GGA TTC AGC ACC TTG GCA CAG CTG AAG CAG AGG AAC 481 
2u Arg val Met Gly Phe Ser Thr Leu Ala Gin Leu Lys Gin Arg Asn 
140 14S ISO 1" 

ACG CTG AAG GAT GGT ATC ATC ATG ATC CAG ACG CTG CTG ATC ATC CTC 
Thr Leu Lys Asp Gly He He Met He Gin Thr Leu Leu He He Leu 
160 165 



49 



57 



145 



193 



241 



289 



337 



385 
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TTC ATC ATC GTG CCT ATC TTC CTG CTG CTG GAC AAG GAT GAC AGC AAG 
Phe lie He Val Pro He Phe Leu Leu Leu Asp Lys Asp Asp Ser Lys 
175 180 

GCT GGC ATG GAG GAA GAT CAC ACC TAG GAG GGC CTG GAC ATT GAC CAG 
Ala Gly Met Glu Glu Asp His Thr Tyr Glu Gly Leu Asp He Asp Gin 
190 19S 200 

ACA GCC ACC TAT GAG GAC ATA GTG ACG CTG CGG ACA GGG GAA GTG AAG 
S ?hr Tyr Glu Asp He Val Thr Leu Arg Thr Gly Glu Val Lys 
205 2" 215 

TGG TCT GTA GGT GAG CAC CCA GGC CAG GAG TGAGAGCCAG GTCGCCCCAT 
Trp Se' Val Gly Glu His Pro Gly Gin Glu 
220 225 230 



577 



625 



673 



723 
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10 20 30 



50 60 70 

* * * 

GACGGATCGGGAGATCTCCCGATCCCCTATGGT 



» w - * 

***** CGACTCTCAGTACAATCTGCTCTGATGCCGCATAGTT 

CTGAGAGTCAT6TTAGACQAGACTACGGCGTATCAA 



CTGCCTAGCCCTCTAGAGGGCTAGGGGATACCAGC 

100 110 120 ^ 130 ^ 140 

150 160 • 170 180 130 ^ 200 ^ 210 



^^^^^^^^^^^^^^ 



290 300 310 320 ^ ^ ^ ^ 



330 340 3S0 



360 370 



380 390 400 410 420 



CCCAlcGACCCCCG^CCATrGACGTCAATAATC^CGXATaXTC^^^^ 

Sgttgctgggggcgggtaactgcagttattactgcatacaagggtatcattgcggttatccctgaw^ 

530 



570 



580 590 600 



610 620 630 



tgggIcttt;ctac;tggcIgtacItcxa;gtat;agtc;tc^^^^^ 

ACCCTGAAAGGATGAACCGTCATGTA(aiTGCATAATCAGTAGCGATAATGGTACaiCTACGCCAi^ 

Kfio 690 700 

640 650 660 670 ^ 680 ^63^ ^ ^ 
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TCn 7fi0 



710 720 730 



740 750 760 770 



TCSGGAGTTTGTTTTGGCACCAAAATCAACXWGACTTTCCAAAATGTCGTJACAACTCC^^ 
ACCCTCAAACAAAACCGTGGTTTTAGTTGCCCTGAAA<WmTACAfiaV.TTGCT 



780 790 



800 810 820 830 840 



«♦**•* •*• 

CAAATOKSCGGTAGGCGTGTACGGTGGGAGGTCTATATWiGCAGAGCTCTCTGGCTAACTJW^^ 

GTTTACCCGCCATCCGCACATGCCACCCTCCAGATATATTCGTCTCGAGAGACCGATTGATC^^ 



850 860 370 880 



890 900 910 



CTGCrrACTGCCrTATCXSAAATTAATACGACTCACTATAGGGAGACC^^ 

gacgIatgaccgaatagctttaattatgctgagtgatatccctctgggttcgaccgat^ 

920 930 940 950 960 970 980 

cctatagagtctataggcccacccccttggcttcttatgcatgctatactgtttttggot 

SxATCTCAGATATCCGGGrGOGGGAACCGAAGAATACCrrACGATATGACAAAAACCGAACCCC^^ 

990 1000 1010 1020 1030 1040 lOSO 

• 

ACACC-CCGC^CCTCATGTTATAGGTGATGGTATAGCrTAGCCTATAGGTGTGGGTTArrGACCATTAT 

TGTGG^CGAAGGAGTACAA?ATCCACTACCATATCGAATCGGATATCCACACCCAATAACTGGTAATA 
1060 1070 1080 1090 1100 IHO 1120 




1130 1140 IISO 1160 1170 1180 ^ 1190 

«.** • ••♦* * * * 

TTTA-^GGCTATATGCCAATACACTGTCCTTCAGAGACTGACACGGACTCTGTATTTTTACAGGA^ 

AAAT^CCGATATACGGTTATGTGACaGGAAGTCTCTGACTCTGCCTGAGACATAAAAATGTCCTA^^ 

1200 1210 1220 1230 1240 ^ 1250 ^ 1260 

TCTclTTTAirATTCACAAlTTCAkTATACAAOVCCACCGTCCCCAGTGCCCGCAGTTTTTA^ 

ISSIStaataaatgtttaagtgtatatgttgtggtggcaggggtcacgggcgtcaaaaataat^^ 

1270 1280 1290 1300 1310 1320 1330 




1340 1350 1360 ^ 1370 ^ 1330 ^ 1390 ^ 1400 

CTTCTACATCCGAG^CCTG^CCcIxGCcicCRG^CTkTGGTCGCTCGGCAGC^ 

SS^SLctcgggacgagggtacggaggtcgctgagtaccagcgagccgtcgaggaacgaggattg 
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,410 ^1«0 ^X«0 ^X440. ^1450 ^1«0 ^1470 
.^.'.r^rrJ^GAOTAOSttCAG^CGATOICcIcCACCRCCACrrGTGCCGW 



1490 1500 15X0 



1480 - . ^ ^ 



1S20 1S30 1540 



1«. 15.0 IS" 

,„o i«= . . . 

,„o 170. m» '™ 

.„0 177. 17.0 17=0 _1..0 _l.i; 

ift70 1880 1890 
1830 1840 1850 I860 1870 

.300 ISIO 1«0 1930 .^1940 ^X9S0 ^1960 

1370 1980 1990 2000 ^20.0 ^2020 ^2030 

,0.0 .050 2060 2070 ^ 2080 ^ 2090 ^2100 

TCCCCGTTTGTTGTCTACCGACCGrrGATCTOXSTGTCAGCTCCfflVCTM^^ 
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2180 2190 ^ 2200 ^ 2210 ^ 2220 ^ 2230 ^ 2240 

SrC«SSSScCTGTGG;.CACCTCTCTTTCCGTTTCACCTAa^^ 

,„0 ^2260 ^ 2270 ^ 2280 ^2290- ^ 2300 ^2310 

S^SSS?lS?S^^cSTCGAACTGTTGTmTCTA.C^ 

. .320 2330 ^ 2340 ^ 2350 ^ 2360 ^ 2370 ^ 2380 

CGCG^CCACCCTCAiAGGcIrCACCGCGG^CAG^TGAATATCAAATCCTCCTCG™ 

SgSS^SSS^tccgtagtggcgcccggtccacttatagtttaggag^ 

,390 ^2400 ^2410 ^2420 ^2430 ^2440 ^2450 
,™»rrrraGAlGTCA^CCCGCTTTTGAGAiwAGTACTCACCCCAACAGCTGGCCCTCGCA^^ 

^ISSS^^c^SIcgScSaaactc^^^^ 



2470 2480 2490 2S00 ^25X0 ^ 2520 

,S30 2540 2S50 2560 2570 ^ 2580 ^ 2590 

2810 2620 2630 2640 2650 ^ 2660 



^^^•GCGTCAATGGGGCGGAGTTGTrACGACATTTT 

matatctggagggtggcatgtgcggatggcgggtaaacgcagttaccccgcctcaac^^^ 

2680 ^ 2690 ^ 2700 ^ 2710 ^ 2720 ^ 2730 
rrJ^GTCC^TTGl-rrrKTGCCAAAALyvAcicCCA^TGACGTCAATGGGGTGGAG^^ 

S^JSSSSS^aSccacggttttgtttgagggtaactgc^^^^ 

2740 2750 2760 2770 2780 ^ 2790 ^ 2800 



TATATAGACCTCCCACCGTACACGCCTACCGCCCAT 
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2810 ^0 ^ 2830 ^ 2840 ^ 2850 ^ 2860 ^ 2870 
GACTlATACGTAGATGTACTGCCAlGTAG^AAAG^CCCA^AAfiGTCATGTACTG^ 

SS^^Itccatctacatgacggttcatcctttcagggtattccagta^^^ 

2880 2390 ^2900 ^2910 ^2920 ^2930 ^2940 

GGCclTTTACCGTclTTGACGTCAlTAGGiKSCGiACTTGGCATATGATACAC^ 
SS^nATScAGTAACTGCAGTTATCCCCCGCATGAACCGTATACTATGTGAACTACAT^ 

29S0 2960 ^ 2970 ^ 2980 ^ 2990 ^ 3000 ^ 3010 

tWGclGTrrlcCGTLuiTAGTCCA^CCAT^CGTCAATGGAAAGTCCCTAT^^ 
?SS?SSTTrATCAGGTGGGTAACTGCAGTXACCmaGGGAT^^^ 

3020 3030 , ^ 3040 ^ 30S0 ^ 30S0 ^ 3070 ^ 3080 

ATAC^CAT^ArrGlcGTcL^TGG^GGG^CGirGGG^TCAGCCAGGCGGGCCA^ 
S^SSSTAACTGCAGTTACCCGCCCCCAGCAACCCGCCAGTCGGTCCGCa^ 

3090 ^3100 ^3X10 ^3120 ^3X30 "^3140 ^3150 

TATGTAACG^GGAACTCCAkTAT^CrlTGAA^AATCACCCCGTAATTGATTACTA^ 
SS??^SS?GAGGTA-ATACCCGATACrrGATTACTGGGGCAW^^ 

3170 ^3X80 ^3190 ^3200 ^32X0 ^3220 
TCAATAATclATGTCCTGclTrAATGAATCGGCcL.CGCGCGGGkGAGGCGGTTTGCOT^^ 

2??I??SSSacgtaattacttagccggttgcgcgcccctctccgc^^^ 



3230 



3240 3250 3260 3270 3280 3290 



CTTCCGCTTCCTCGCTCACTGACT^GCTGCGCTcisTCG^CGGCTGCGGCGAGCGGTAT^^ 

SSSIgSScgagtgactgagcgacgcgagccagcaagccgacgccg^^^^ 

3300 ^3310 ^3320 ^3330 ^3340 ^3350 ^3360 

^g^cggtIatac^atccacIgaat^ggg^taacgcaggaaagaacatgtgagc^^ 

^SSSSSxAGGTGTCTTAGTCCCCTArrGCGTCCTTTCTTGTAC^^^ 

3380 ^3390 ^3400 ^34X0 ^3420 ^3430 

caaaIggccIggaaccgtaIaaaggccgcgttgc^ggcg^ccatag^^ 

STTTCCGGTCCTTGGCATTTrTCCGGCGCAACGACCGCAAAAAGGTATCCGAGGCGGGO^ 

3440 34S0 3460 3470 ^ 3480 ^ 3490 ^ 3500 

atca^tcgacgctcLgtcLagg^ggcgLcccgacaggactata^^ 

ilSSTTTTAGCTGCGAGrrCAGTCTCCACCGCTTrGGGC^^ 
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3510 



3S2C 3S30 3540 3550 3560 3570 



CCCTGGAAGCrCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCG(aTACCTGTCCGCCTTTCT^ 
cSSSSSAGCACGCGAGiWMACAAGGCTGGGACGGCGAATGGCC^^ 

3580 3590 3600 3610 3620 3630 3640 

SHSSJSS^SSSS^IcSSSS^S^ISctSIg^ 

36S0 36S0 3670 3680 3690 ^ 3700 ^ 3710 

3720 3730 3740 3750 3760 3770 3780 

• * 

TGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCG 

Ic?SS?S?^ttctgtgctgaatagcggtcaccgtcgtcggtc^^ 

3790 3800 3810 3820 . 3330 3840 38S0 

. ; * . . * * * 

ifiCTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCrAACTACGGCTACACTAGAAGGACAGTAT 

tcSSt^SSSItgtctcaagaacttcaccaccggattgatgccgatgt^^^ 

3860 3870 388,0 3890 ^ 3900 ' ^ 3910 ^ 3920 

TTGG^ATCTCCGCrJTGCTiuVGC^AGTTlcCTT^AAL^AGTTGGTAGCTCTTGATK 
SS^GSAGACGACrrCGOTCAATGGAAGCCTTTTTCTO^^ 

3930 3940 3950 ^3960 ^ 3970 ^ 3980 J990 

AACcIcCGCTGGTAGCGGTLTTTiTTTG;TTG«i«.CA^CA^^^ 

TTGGTGGCGACCATCGCCACCAAAAAAACAAACGTTCGTCGTCTAATGCGCGTCTTTTTTTCCTA^^ 

4000 4010 4020 4030 ^ 4040 ^ 4050 ^ 4060 

gaagatc^at^Jtacg^k^tc^gacg^cag^ggaj^ 

CTTCTAGGAAACTAGAAAAGATGCCCCAGACTGCGAGTCACarrGCTTTTGAGTGCAATTCCCTA^ 

4070 4080 4090 ^ 4100 ^ 4110 ^ 4120 ^ 4130 

tcatgaacaItaaaIctgtctgct^acatIaacactaatIcaaggggtgttatgagcc^^^^ 
ISIS^^ISttgacagacgaatgtatttgtcattatgttccccacaat^^^ 

4150 ^4160 ^4170 ^4180 ^4190 ^4200 

AAACGTCTTGCTCGlGGCcizGAT^AAATicCAAkTGGlTGCTGATl^ATAT^^ 
mGSScGAGCTCCGGCGCTAAmAAGGTTGTACCTACGACTAAATATACCCATArn^ 
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4220 4230 4240 42S0 4260 " 4270 



oyiTLlTGTCCWGCWlTCAaSTGckcAATCTATCGATrGTATGGGAAGCTC^^ 
OTli^ASGCCCGTTAGTCCACGCTGTTAGATAGCrAACATACCCTTCGGGCT^^ 

4280 4290 4300 4310 4320 4330 4340 

ctgaIacatggcaaIggtagcgttgccaatgatgttacagatgagatgg^ 

SS??SIcCCTWCCATCGCAACGGTTACTACAATGTCTACTCTAC^^ 

4350 4360 4370 4380 4390 4400 4410 

. * * 

AAriTATGCCTCTTCCGACCATCAAGCATTTrATCCGTACTCCTGATGATGCATGGTTJ^ 

J^ASTACGGAGAAGGCTGGTAGTTCGTAAAATAGGaTGAGGACTACTACGTACCAAT^^ 

4420 4430 4440 4450 4460 4470 4480 

GAT-CCCGGGAAAAOIGCATTCCAGGTATTAGAAGAATATCCTGATTCAGGTGAAAATATTGTT^ 
SISSScTTTTGTCGTAAGGTCCATAATCTTCTTATAGGACTAAGTCCACm 



4490 



4S00 4S10 4S20 4S30. 4S40 4550 



♦ ♦ * * 



r--GC5cy^GTGTTCCTGCGCCGGrrGCATTCGATTCCTGTrrGTAATTGTCCTTTTAACAGCGATCGCCT^^ 
SSSIScGCGGCCAACGTAAGCTAAGGACAAACATTAACAGGAAAATTCTCGC^^^ 

4560 4S70 4580 4590 ^ 4600 ^ 4610 ^ 4620 

ttcgtctcgctcag^cgcaItcacgaatgIataa^ggtttggtigatgcg^ 

AAGCAGAGCGAGTCCGCGTTAGTGCTTACTTAT7GCCAAAC<»ACTACGCTCACTAAAACTACTG^ 

4640 4650 4660 4670 4680 4690 



4630 



TRATGGCrGGCCTGrrGAACAAGTCTGGAAAGAAATGCRTAAACTTTTGCCATTCTCArc 
ISASScCGGACAACTTGrrCAGACCTTTCTTTACGTArrTGAAAACGGTAAGAGTGGCCTA^ 

4700 4710 4720 4730 4740 4750 • ^ 4760 

... ♦ • ♦ ♦ ♦ . ♦ * * * 

GTCACTCATGGTGATrrCTCACTTGATAACCTTATTTTTGACGAGGGGAAATTAATAGGTTGTATTGi^ 

SSSSISSSLagtgaactattggaataaaaactgctcccctttaattatcca^^^ 

4770 4780 4790 4800 4810 ^ 4820 ^ 4830 

4840 4850 4860 4870 4880 4890 4900 

********** 

TCCrTCATTACAGAAACGGCTTTTTCAAAAATATGGTATTGATAATCCTGATATGAAT;^ 

SSJStotctttgccgaaaaagt^ataccataactact 



FIGURE 19 CONTINUED 



35/42 



W099/586S8 -v PCT/US99/10646 



4910 



4920 4930 4940 49S0 4960 4970 



OVTTTCSATGCTCGATaWJTTrTTCTAATCAGAATTG^^ 

4980 4990 5000 SOlO S020 ^ 5030 ^ 5040 

GCCMATACAWTTTGAATCTATTTAGAAAAATIWACAWITJWXKSGT^^^ 
ScCTATGTATAAACTTACATAAATCTTTTTATTTGTTTATCCCCAAOGCGCIK^ 

soso 

• * 

GCCACCTGACGTC 
CGGTGGACTGCAG 
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20 30 40 SO 60 70 

^ ^ * * ♦ ♦ ♦ ♦ ♦ ♦ * * * * 

GCTAGCGCCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGm 
CGATCGCGGCGGTGGTACCCTTACGTCCACGTCTAGGTCTCGGAaUVAGACGAOSAG 

MGMQVQI QSIiFLLLIiWVP> 



80 90 100 no 120 130 140 

^^************ 

GGTCCAGAGGACACACCCTGTGGAAGGCCGGAATCCTGTATAAGGCCAA 
CCAGGTCTCCTGTGTGGGACACCTTCCGGCCrrAGGACATATTCCGGCT 

GSRGHTLWKAGILYKAKFVAAWTL> 
ISO 160 170 180 190 200 210 



* ★ ♦ ★ 



* * 



GAA<KCTGCCGCTTTCCTGCCTAGCC3ATrrCTTTCCTAGCGTGAAGCTGACCCCACroTGCGTC^^ 
CTTCCGACGGCGAAAGGACGGATCGCTAAAGAAAGGATCGCACTTCGACTGGGGTGACACGCACTGGGAC 
KAAAFL?SDFFPSVKI.TPLCVTL> 

220 230 240 250 260 270 280 

***** *****, 
TATATGGATGACGTGGTGCTGGGAGCCAGCATCATCAACTTCQAGAAGCTGGGACTGTCCAGATACGTG^ 
ATATACCTACTGCACCACGACCCTCGGTCGTAGTAGTTGAAGCTCTTCGACCCTGACAGGTCTATGCACC 
YMDDVVLGASIINFEKLGI-SRYV> 



290 30C 310 320 



330 340 350 



♦ * ♦ 



***** 



CTAGGCTGATCCTGAAGGAGCCTGTGCACGGCGTGTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGAC 
GATCCGACTAGGACTTCCTCGGACACGTGCCGCACAGGTGGGACGGTCTCTGGTGGCACCACTCCTCCTG 
ARLI LKE?V HGVSTLPET TVV RRT> 

3S0 370 380 390 400 410 

♦ **.*♦•♦**** 

CGTGTACTATGGAGTGCCTGTGTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACC 

GCACATGATACCTCACGGACACACCTTCACCGACTCGGACGACCACGGGAAACACCCATGG 
VYyGVPVWK MLSXL VPFVGT> 
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10 20 30 40 



50 SO 70 



********* 



GCTAGCGCCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCT^^^ 

StcSggcggtggtacccttacgtc^^^ 

MGMQVQIQSLFLLLLWVP> 



80 90 100 110 



♦ * ♦ 



* * * ♦ 



120 130 140 

6 S 



r" G HTLMKAGILYKAKFVA AWTL> 



150 160 170 180 190 200 210 



GAAGGCTGCCGCTTTCCTGCCTAGCGATTTCrrrTCCTAGCGTGAAGCTGACCCCA^^ 
S^^ScGGCGAAAGGACGGATCGCTAAAGAAAGGATCGCACTTCGACTGGGG^^ 

KAAAFLPSDFFPSVKLTPLCVTL> 

220 ^ 230 ^ 240 ^ 250 ^ 260 _ 270 ^ 280 

tatatcwtgacgtggtgctgggagtgggIctgtccaggtacgtggctaggctgatcctg^ 
SataSactgo^ccacgaccctcaccctgaoiggtccatgcaccgatccgactaggacw 

YMDDVVLGVGLSRY VAR I.1LK = P> 
290 300 310 320 330 340 350 

« * ♦ * * ♦ * ♦ 

tgcacggcgtctccIccctcccagIgaccaccgtggtgaggaggaccgtgtactatggagtgcctc 
SSgSSSLtgcgacggtctctggtggcaccacxcctcctggca^^^ 

VKGV STLPETTVVRRTVYYGVt'v 



350 370 380 390 

♦ ♦♦♦**♦* 

gaagtggctgagcctgctggtgccctttgtgtgaggtacc 
cttcaccgactcggacgaccacgggaaacacactccatgg 

KWLSLLVP FV*> 
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